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SOME REFLECTIONS—STATISTICAL AND OTHER— 
ON A NON-MATERIAL UNIVERSE* 


By Aurrep J. Lorka 
Metropolitan Life Insurance Company 


Ss YOU KNOow, the exigencies of the war caused us to abandon our 
A plans for the regular annual convention in Cleveland. 

It had been my expectation to address you informally at a luncheon 
meeting, in that genial mood which comes as a sort of blessing after the 
meal—when the critical faculties are just a little dulled; when minor 
blemishes pass unobserved and major defects are more easily forgiven. 

But now I must transcribe my message. The lighter cargo of the 
spoken word must be brought to you in the heavier vehicle of the 
printed page. Nevertheless, light cargo it shall remain, for I have it on 
good authority that people can be fatigued without knowing it, but 
they cannot be bored without knowing it. And I shall try not to bore 
you. 

The statistician is in one respect—and in only one respect, I fear—a 
fortunate person. The Universe is his topic. He can discourse about any- 
thing and everything. I recall a drawing in an old American magazine, 
since discontinued. It pictured a class of undergraduates sitting for an 
examination. The professor was writing the questions on the black- 
board. The first question was: “What is, and why is it?” I think this 
must have been a class in statistics. 

It is not my purpose to attempt here a complete answer to the pro- 
fessor’s question. I have in mind to cover only one half of the Universe, 
a certain special half. Classification is an art that allows considerable 
freedom to those who practice it. Taking advantage of this freedom I 
shall for the present occasion divide all statistics into statistics of ma- 
terial, and statistics of non-material things; the latter, as we shall all 
agree, being by no means immaterial things. It is about these non- 
material statistics that I ask you to think with me for a little while. 

* This paper was to have been given as the Presidential Address at the 104th Annual Meeting 
of the American Statistical Association, to have been held at Cleveland, Ohio, December 1942. 

In response to a request from the Office of Defense Transportation to avoid adding to the traffic 


congestion during the holiday season the meetings were cancelled. This makes the first gap in an other- 
wise unbroken series of 103 Annual Meetings of the Association. 
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Although the non-material Universe is ever-present with us, or per- 
haps because of this very fact, we are inclined at times to ignore or even 
to question its presence. It is true that much of it is trivial. I refer to 
the Universe of thought. Every one of the millions of normal adult 
human inhabitants of this globe, day in and day out, throughout his 
waking hours, spins a thread of thought. The thought just now in Mrs. 
Jones’ mind may be nothing more important than the question where 
Mrs. Smith bought her hat; while Mr. Smith may be thinking of the 
$10 he won last night at cards from Mr. Jones. Statistics, even of such 
trivial thoughts, may not be wholly without value. Psychologists and 
psychiatrists may have an interest in them. But not all thought is triv- 
ial. There is much thought about the intensely practical affairs of life. 
And there is an aristocracy of thought. Even this is not immune from 
attack by the statistician. 

Anyone may, if he so chooses, conduct a statistical inquiry into his 
own thoughts, dealing directly with the non-material data themselves. 
But if the inquiry is to be made more objective, to cover thoughts of 
many individuals, some form of materialization of these thoughts must 
intervene; the statistics then are no longer quite strictly statistics of 
non-material things, but actually statistics of the material symbols 
that stand for these things. Such symbols may be spoken or written 
words or sentences, or they may be corresponding musical performance 
or notation. 

They may be spontaneous expressions, or they may be solicited 
answers to questionnaires or polls. As to these latter, it is interesting to 
note that here we meet with a phenomenon long familiar in this field, 
but only lately recognized in the domain of physical observations also, 
namely that the very act of interrogating alters the state of the object 
addressed, so that strictly speaking we can never by questioning be 
really sure of discovering the exact and true answer to the question 
posed. We note with regret that illustrations of this fact are easily 
gathered in any court of justice. 

The statistical analysis of expressed thought has a certain interest 
and utility from various angles. The practical man, who is primarily 
concerned with making life possible and economically profitable, has 
had occasion to play with statistics of both the spoken and written lan- 
guage. The printer, unless he casts his type as he goes, wants to know 
in what proportion to stock the various letters of the alphabet. For 
quite different reasons the cryptanalyst, in solving intercepted cipher 
messages, is interested in the frequency of these letters in different 
languages. 
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The telephone engineer has explored! the frequency of the principal 
words in colloquial English, since he is vitally interested in securing the 
best possible transmission, and this obviously requires a knowledge of 
the composition of spoken sounds. 

While his investigations were made primarily for purposes of practi- 
cal phonetics, in its application to telephone transmission, incidentally 
some of the by-products throw interesting sidelights in other directions. 
Our language is a population of words in which there is a sort of social 
stratification of function. Nouns, names of concrete things or abstract 
concepts, were found to be by far the most varied, nearly one half of 
the different words used—to be exact, 1,029 out of 2,240, in 500 tele- 
phone conversations. The number of different verbs were less than half 
this, namely 456. This, of course, is not accident, but has a meaning. 
The variety of things appearing in the universe of conversation analyzed 
was greater than the variety of happenings in which these things were 
involved. This is not surprising, for a typical statement or clause con- 
tains, associated with its verbal predicate, a subject, an object (if the 
verb is transitive), and in many cases further nouns qualifying the 
subject or object or the verb itself by an adverbial phrase. However, in 
these telephone conversations, the total occurrences of verbs (including 
repetitions, but exclusive of auxiliary verbs), namely 12,550, actually 
exceeded slightly the total occurrences of nouns, 11,660. While the same 
verb was repeated on an average eleven times in the total of nearly 
80,000 words, the same noun was repeated on an average less than three 
times. The relatively low proportion of total nouns is probably peculiar 
to telephone conversations, which on the other hand, are for obvious 
reasons very rich in pronouns, the words “I” and “you” heading the 
list of all words.? 

All this suggests that certain old-fashioned school exercises in gram- 
mar can be developed into something more interesting and of greater 
significance, in the form of quantitative parsing. This sort of study prac- 
tically amounts to a statistical appraisal of the furniture of the mind, 
insofar as this is reflected in language. 

True, not all words necessarily reflect useful mental furniture. So it 
has been remarked that the ether of the physicist of the 19th century 
may have been nothing more than the subject for the verb to undulate. 


1N. R. French, C. W. Carter, Jr., W. Koenig, Jr., “The Words and Sounds of Telephone Con- 
versations,” Bell Telephone System Technica! Publications, June 1930; N. R. French and W. Koenig, Jr., 
Journal Acoustical Society of America, 1929, p. 110; H. Fletcher (Bell Telephone Laboratories), Speech 
and Hearing, Van Nostrand, 1929. 

? Compare also Vernon Lee, The Handling of Words and Other Studies in Literary Psychology, 1923; 
see especially Chapter VI, pp. 187 et seq., L. W. Faucett, A Study of English Word Values, 
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The words mind, soul, may be just grammatical conveniences, subjects 
for such verbs as to think, to feel, etc. Such words, if not very useful, 
may nevertheless be relatively harmless. But they may also cause con- 
fusion, or worse, a sort of pseudo-scientific ponderosity. There is such 
a thing as “attempting to conceal truism behind pretentious termi- 
nology.” 

Singularly enough, the trap of verbiage, from which writers aspiring 
to profundity have not always escaped, has been quite clearly per- 
ceived by the comedian and the wit. So Moliére in his Le malade imagi- 
naire, has the candidate for a Doctor’s degree in medicine interrogated 


“Quare opium facit dormire?” 


to which he gives the illuminating reply 
“Quia est in eo virtus dormitiva.” 


More lately, and in less classic vein, that versatile author Anon has 
defined the elusive term “life”’—whose definition great biologists have 
discreetly sidestepped: 

There was a young man of Cadiz 
Who affirmed Life is what it is; 
For he early had learnt, 

If it were what it weren’t 

Life would not be what it is. 

The fact is that our language comprises, among others, two great 
classes of words. The first of these is what might be called Berlitz 
words, which can be defined, as in the Berlitz method of teaching for- 
eign languages, by pointing to an object. The second class is what I 
might call context words, the meaning of which (if any) can be ascer- 
tained only by examining the contexts in which they occur. Some of 
these latter words, undoubtedly, are mere counterfeits, or even slugs, 
which have been smuggled into the circulation of linguistic currency. 
Such, for example, is the word “ghost.” 

But these words of doubtful legitimacy are after all a minor part of 
our speech and writing. They represent a chapter in the morbid an- 
atomy of language. Exaggerations regarding these and other linguistic 
failings, to which a certain overpopularization of semantics has led 
several authors,’ will be avoided in any well-balanced study, for which 
a standard has been set by Helen Eaton’s‘ scholarly work. In an ap- 
pendix to her Semantic Frequency List for English, French, German and 

3 For criticism of this, see M. Schlauch, “Semantics as Social Evasion,” Science and Society, 1942, 
Vol. 6, p. 315. See also, by the same author, The Gift of Tongues, Modern Age Books, 1942. 


4H. Eaton, Semantic Frequency List for English, French German and Spanish (first six thousand 
words), Chicago University Press, 1940. 











SS. a 


— oa 


ee ae. 


= 











— oe -- 
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Spanish, she brings a statistical tabulation of the 6,000 most common 
words classified in natural categories, which may be thought of as the 
basis for an analysis of our conceptual universe. 

The parallelism between that universe and the universe of language is 
admittedly incomplete. For one thing, our English language, in particu- 
lar, is extremely rich in synonyms, two or more words denoting the 
same or closely similar concepts. A comparative study of this multipli- 
cation of terms in different languages would be of interest. We recall, 
for instance, that the French have no word for home, but must get 
along with a house or a hearth. The Frenchmen can distinguish only by 
mien, not by specific words, between his sentiment for Colette and his 
appreciation of a good cigar, or even of a piece of chocolate. But the 
English language, too, has its lacunae. The French vieillard, the German 
Greis, the Latin senex, and the Greek yépwy have no counterpart in 
English; though according to our custom we have appropriated the 
Greek root in naming a branch of science, gerontology, recently come 
into prominence on account of the aging of our population to which our 
vital statisticians have been drawing attention. 

Such lacunae in various tongues may be fortuitous. But wealth of 
synonyms is clearly enough ascribable to definite causes. English, un- 
doubtedly, owes its wealth in this field largely to its polyglot origin. 
But other factors also enter, irrespective of any particular language. 
Words with painful associations tend to run to synonyms. So in English 
we avoid reference to death and speak of passing away. We have three 
words, graveyard, churchyard, cemetery; the German speaks of Kirch- 
hof, Friedhof, Gottesacker. Examples could easily be multiplied. For 
more or less disreputable things slang often supplies a whole string of 
synonyms. Another motive for multiplying synonyms seems to be the 
desire to achieve and surpass superlatives: big, large, great, huge, im- 
mense, enormous, gigantic, monstrous, jumbo, etc.; and at the other 
extreme, small, little, petite, pygmy, puny, minute, diminutive, tiny, 
wee, etc. 

To the creative writer, the littérateur, this wealth of synonyms, allow- 
ing as it does fine shadings and distinctions of meaning, is a treasure. 
But there is also a diametrically opposite attitude. To a certain type 
of mind the ideal language would be one in which each word or symbol 
would have one single meaning, and in which for each meaning there 
would be a single word. On the construction of such a language, free 
from all ambiguity and redundancy, certain mathematicians have 
labored, with perhaps somewhat uncertain success. And there is also 
what might be called the economic philologist, who wishes to undo the 
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mischief of the Tower of Babel. Attempts to substitute highly efficient 
artificial languages for the inefficient natural tongues which, like Topsy, 
“just grew,” do not seem to have met with much success, and in fact 
have built a little Tower of Babel of their own. Ogden’s proposal of 
Basic English, which manages somehow to say most of what is worth 
saying, by the use of only about 850 words, seems to be a more practi- 
cal venture. A perhaps minor defect from which it suffers is a strange 
sort of irreversibility, in that any English-speaking person can read, 
but by no means write it. To do the latter he must first learn what 
words not to use—a singularly backhanded method of learning a lan- 
guage. The foreigner must learn some 800 English words. But the 
native must “unlearn” many thousands. 

I have remarked that the practical man, primarily concerned with 
making life possible and economically profitable, has had occasion to 
concern himself with statistics of language. But the artist and the 
scholar also, whose chief contribution is to make life pleasant and inter- 
esting, have paid some attention to, and certainly have supplied much 
material for, the statistics of thought expressed in sound and symbol. 
The statistician has not hesitated to apply his technique to Horace’s 
Odes, to Greek statues or to literary style. Among the latest contribu- 
tion in the latter class are two articles in Biometrika, the first by G. U. 
Yule,* followed soon after by a more detailed article by C. B. Williams.” 
The latter compared the frequency distribution of sentence length in 
Bernard Shaw, G. K. Chesterton and H. G. Wells. A good many years 
ago, I myself started what I might call an Inventory of the Poet’s 
Mind. From concordances of Wordsworth, Milton and Keats, I noted 
the frequency of a considerable number of words. For example, the 
relative number of mentions of father, mother, sister, brother, in 
Wordsworth was in the ratio of 37 to 35 to 10 to 18; in Milton the ratio 
was 60 to 20 to 7 to 13; in Keats, 25 to 16 to 28 to 31. I did not continue 
this investigation when I found that this kind of thing had already been 
done by others. So, for example, Niceforo in his Méthode Statistique, 
quotes a study of the frequency of mention of various colors in the 
sonnets of Baudelaire, and gives many examples of the application of 
statistics to literature and art, among others a study of the distribution 
of sentence length in Balzac. 


5 Both the First World War and the present war have been the occasion for a revival of the move- 
ment for an international language. In this connection see Nature, 1942, Vol. 150, pp. 622 and 718; 
1943, Vol. 151, p. 80. 

* G. U. Yule, “On Sentence Length as a Statistical Characteristic of Style in Prose: With Applica- 
tion to Two Cases of Disputed Authorship,” Biometrika, 1939, p. 363. 

7 C. B. Williams, “A Note on the Statistical Analysis of Sentence Length as a Criterion of Literary 
Style,” Biometrika, 1940, p. 356. 
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A recent contribution is a statistical study of sound patterning in 
poetry by B. F. Skinner,* who has also examined the frequency dis- 
tribution of response words in association tests. Plotting the logarithms 
of frequencies as ordinates against the logarithms of ranks as abscissae 
(the most frequent word has rank one, etc.), he found that over the 
major part of the range the data fell essentially along the straight line. 
This relation, however, is not restricted to words collected by the proc- 
ess of association. It appears to be the general characteristic of words 
in actual use. It had been previously observed and discussed by G. K. 
Zipf,® who has made interesting suggestions regarding the underlying 
causes of the phenomenon. It is to be noted that relations of this kind 
are observed in a great variety of seemingly unrelated phenomena in- 
cluding, among others, the frequency distribution of scientific pro- 
ductivity and the relation between rank and population of the cities of 
the United States and of certain other countries. 

The population of words, like any population, grows by new acces- 
sions in excess of eliminations. The new accessions, in modern times, 
nave largely been names for new objects created by man’s ingenuity. 
Curiously enough, the same agency has caused the virtual disappear- 
ance of words. How many of the younger generation know what is a 
pair of snuffers? Some ingenious person hit on the plan of so braiding 
the candlewick that it curled as it burned, its tip projecting into the 
oxidizing zone of the flame, to be there consumed as it goes, instead of 
having to be trimmed with a special kind of scissors. And so, while 
snuffers may be found occasionally in an antique shop or a museum, 
the phrase “pair of snuffers” is today little more than a dictionary 
entry. Words can also be lost simply by being found superfluous. The 
word “shall” is thus in process of becoming extinct in the Anglo- 
American language. 

The ideal dictionary would not merely list words with their meaning, 
but would give statistical data regarding their relative frequency in 
spoken and in written language." Successive editions, published at 
considerable intervals, would give a quantitative record of the history 
of the language. This would have been of particular interest for the 


* B. F. Skinner, “A Quantitative Estimate of Certain Types of Sound Patterning in Poetry,” 
American Journal of Psychology, 1941, p. 64; S. W. Cook and B. F. Skinner, “Some Factors Influencing 
the Distribution of Associated Words,” Psychological Record, 1939, p. 178. 

°G. K. Zipf, Selected Studies of the Principle of Relative Frequency in Language, Harvard Press, 
1932; The Psychobiology of Language, Houghton Mitt!in, 1935, second edition, 1939; National Unity and 
Disunity, Principia Press, 1941, pp. 398-399. 

10 A. J. Lotka, Science, 1939, Vol. 94, p. 164; Journal Washington Academy of Science, 1926, Vol. 16, 
p. 317; Elements of Physical Biology, 1925, p. 306. 

1 This is not altogether a dream of the future. Helen Eaton, loc. cit., p. xiii, quotes F. W. Kaeding, 
“Hiaufigkeitswérterbuch der deutschen Sprache,” containing 79,716 words divided into 11 categories. 
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period of rapid transformation of Latin into French and the other 
Romance languages. In comparison with this and corresponding transi- 
tions in other tongues, modern speech is relatively at a standstill, 
anchored by the weight of the printing press. 

Studies of thought expressed in speech or writing can remain on the 
purely descriptive level, or on the lowest rung of statistical technique, 
classification and enumeration. They may assume a literary aspect, 
as in the writings of some of our modern or no longer quite modern 
writers, who have sought to fix on the canvas of literature something 
of the ill-assorted stream of consciousness. 

But thoughts are not independent events, to speak in the language 
of the statistician; or, in the language of the physiological psychologist, 
they are conditioned reflexes. In the evolution of human culture, order 
has been established at least in some realms of thought. The condition- 
ing by which one thought leads to another has become systematized. 
Fundamental to this scheme is the principle that competent thought 
must be free from contradictions. At the same time certain realms of 
thought have acquired the character of an organic whole, with results 
of the greatest practical importance, but sometimes perhaps with a 
deceptive appearance of being all-inclusive. Scrutiny reveals the fact 
that the reason for the coherence of this type of systems of thought is 
that they involve a circle which is by no means vicious: Contradictions 
cannot arise in a system which defines a cello as a large violin, and a 
violin as a small cello—to use an illustration cited, if my memory serves, 
by the late J. W. N. Sullivan. If the operations in such a scheme lead to 
inevitably consistent results, and in that sense to truisms, this does not 
by any means deprive the scheme of practical utility, as every physicist 
knows. Whether all competent thought can be united in one coherent 
whole, or in two or more distinct, but each separately coherent parts, 
remains today an open question. At any rate, it seems that the best 
minds come nearest to achieving such coherence; in such minds, as has 
been pointed out by Professor C. J. Keyser,!? thoughts seem to ap- 
proach the structure which the mathematician speaks of as the “group 
property.” 

As to the laws of thought which bring about this pleasing harmony, 
for a long time they had been supposed to have absolute precision, and 
so to be outside the realm of statistics. More recently the sanctity of 
these supposedly absolute laws has been questioned. Systems have been 
proposed in which the sharp lines of classical logic give way to the 
blurred margins of probabilities. As yet the merits of these tentatives 


2 C, J. Keyser, Mathematical Philosophy, Dutton, 1922, p. 227. 
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are a matter of dispute. This is a battlefield in which I find myself a 
non-combatant in no-man’s land, where any overt action on my part 
would expose me to crossfire fore and aft. 

Recorded thought is preponderantly thought of the past. But there 
is other thought—often unexpressed, and for that very reason of par- 
ticular importance: thought of the future, purposive thought. In this 
the statistician may well be said to have a special interest. For the ulti- 
mate use of all statistics, except the “Believe it or not” type, is predic- 
tion. 

Now prediction is of two kinds. On the basis of past observations 
the astronomer predicts eclipses of the sun and the moon, the position 
of the planets, and so forth. He is a sort of elongated dachshund, whose 
nose sniffs tomorrow, while his tail wags yesterday. 

But there is another, totally different kind of prediction. Each of us 
has his own private field of prediction. I can predict events that no 
one else can predict. I can predict that when I have completed this 
sentence I shall get up from my chair and walk around the room. No 
one else could have predicted that. This private field of prediction con- 
sists of the events which we purpose, the events which we successfully 
will. The English language has embedded this fact in its very structure, 
by appropriating the vocable ‘ will” as the auxiliary verb to character- 
ize tense. But note the significant fact that this auxiliary has a com- 
panion of somewhat different complexion: the word “shall,” which in 
some contexts implies a conflict of wills, the imposition of one person’s 
will upon the future actions of another. 

Now, in this particular part of the universe of thought also—pur- 
posive thought—there is much that is trivial, much that is chaotic: 
Purposes and cross-purposes, will contending with will to null effect, 
where harmony might have produced important collective resultants. 
For here also organization is possible. Some kind of order can be 
brought into the assortment of purposes that motivate human actions. 
Such ordering influences have indeed been exerted over a wide field, 
ranging from the sublime teaching of great ethical leaders, through the 
relatively innocuous drivel of radio sales talk, down to the diabolical 
propaganda of a herd of gangsters, and their indoctrination of plastic 
youth with calculated deviltry. A certain, perhaps mistaken delicacy, 
has allowed little play to statistical investigation of the degree of suc- 
cess of ethical teaching. Possibly a healthy, although humiliating les- 
son might be learned from such an investigation. No delicacy stands in 
the way of investigating the effectiveness of advertising as influencing 
consumer demand, and studies along that line are today a common- 
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place—though one wonders how so much piffling advertising has sur- 
vived the scrutiny. 

As for propaganda, though experimental studies have been made un- 
der “laboratory conditions,” propaganda in the wild state of nature, 
and in its most pernicious form, is inherently difficult of access to 
statistical examination, for, to quote C. H. Hoffer, “Propaganda is 
the process of influencing attitudes in a veiled manner, regarding mat- 
ters of controversy within a society. Since it is veiled and partisan in 
nature, propaganda retards the formation of intelligent public opinion, 
and thus is a menace to democracy.” The very obliquity and secretive 
character of propaganda tend to withdraw it from examination in the 
honest light of day. 

The indoctrination of youth is a somewhat debatable subject, though 
the horrible examples of Naziism and Fascism and other “isms” cer- 
tainly give us pause. Some may agree with me and some may not, if I 
say that the function of schools and colleges, aside from cultivating an 
appreciation of the humanities, is to teach facts and methods, thought 
processes, and the faculty of forming independent opinions; but that it 
is none of the functions of such institutions to inculcate ready-made 
opinions. Such indoctrination is abhorrent both to the spirit of scien- 
tific truth and to all sense of rational and honest citizenship. 

But what has this to do with statistics? The answer is obvious: The 
whole mechanism of democratic government is statistical. The election 
of our governing bodies is a statistical experiment on a national scale. 
As to its results, the experiment is not always an unqualified success. 
We have not reached perfection. A variety of unofficial polls which 
have come into vogue perhaps point a direction in which lies improve- 
ment. Certainly, as a gauge of popular sentiment of the nation’s 
thoughts and purposes, these polls are sometimes more representative 
than the actions of the elected representatives of the people. That 
such polls are by no means infallible, and indeed are not wholly free 
from danger of abuse, must be admitted. But abuse does not disqualify 
competent use. The unofficial inventories of the nation’s mind may 
contain the germ of important developments. 

I have remarked that statistics of non-material things are by no 
means immaterial. This may be one of those propositions that need 
only to be stated to be at once accepted as true. Yet I wonder whether 
it is generally realized how greatly, and on what scale, the non-material 
universe of thought, and especially of purposive thought, has affected 
the course of material, physical happenings. Actually it has been noth- 
4 C. H. Hoffer, “A Sociological Analysis of Propaganda,” Social Forces, 1942, p. 445. 
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ing less than a geological agency, a phenomenon of planetary dimen- 
sions. This is a topic that could be expanded at length, but here it must 
suffice to give one or two illustrations. It has been estimated" that the 
amount of carbon dioxide annually thrown into the air by the combus- 
tion of coal by man is about the same as the total carbon dioxide pro- 
duced annually in the respiration of the earth’s total animal population, 
and is of the order of one three-hundredth of the total carbon dioxide 
in the air. It represents a quantity of carbon which surely must have 
taken hundreds of thousands of years to accumulate as coal in the 
earth’s crust. 

The case of nitrogen is even more remarkable. For many years the 
increasing agricultural needs of the rapidly growing human population 
had largely been met by consuming supplies of nitrates accumulated 
through the ages in the saltpeter beds of Chile. As the possible exhaus- 
tion of this supply began to come into view, less than 70 years after it 
had been opened to exploitation (1831), men turned their thoughts on 
this problem, and human ingenuity unlocked the practically inexhaus- 
tible supply of atmospheric nitrogen. By 1935 about 75 per cent of the 
total world’s production of bound nitrogen was obtained from this 
source.!® 

But it is not only, and perhaps not primarily, for the purposes of the 
peaceful conduct of life, that human ingenuity had been applied. Cer- 
tainly a powerful stimulus to the early development of the nitrogen 
fixation industry was the desire in certain quarters to provide for the 
needs of war, to acquire a supply of raw materials for explosives, inde- 
pendent of the Chilean nitre beds. 

It is here that the contrast is seen between the highly effective organ- 
ization of reflective thought, logic, mathematics, theory of probability 
and their applications, on the one hand, and, on the other, the woefully 
incompetent organization of purposive thought. The fact is, that the 
human species as it stands today is sick. It suffers from a complaint 
which is more easily diagnosed than cured, because it is congenital. 

I say that the human species is sick. It is sick in the sense that any 
species afflicted with an assortment of propensities that impedes it in 
the struggle for survival—that perhaps threatens its survival—can cer- 
tainly be said to be sick. If we want a name for this sickness we might 
call it pernicious orthogenesis. 

Now orthogenesis is one of those big words which we prefer to use 
sparingly. I have borrowed it from biology, and even there it is not in 


“4 A. Bethe et al., Handbuch der normalen und pathologischen Physiologie, 1927, Vol. 1, pp. 702, 721. 
EK. R. Riegel, Industrial Chemistry, 1937, pp. 111, 112. 











12 AMERICAN STATISTICAL ASSOCIATION: 


very good repute. But for my present purpose it is a useful word, though 
here, addressed to a mixed audience, it may call for a little explanation. 
Some biologists have thought that the evolution of certain species may 
have proceeded, not by a mere preferred survival of more advantageous 
departures from the mean, but by an inherent tendency of certain spe- 
cies, quite aside from external influences, to drift in a certain direction. 
By a sort of inherent biological proclivity, certain species seem to have 
been predestined to success, or foredoomed to failure. So, for example, 
the giant saurians of Jurassic time perhaps became the victims of their 
fatal propensity for growing bigger and bigger." 

Whether such orthogenesis by physiological processes or anatomical 
features actually has taken place is a matter of dispute among biolo- 
gists. In this dispute we need not take part, because the existence of the 
kind of orthogenesis with which we are here concerned will not be dis- 
puted. 

The fundamental characteristic of orthogenesis is that it proceeds 
in a set direction, regardless of the utility or the fatality of that direc- 
tion. For that very reason orthogenesis becomes a powerful factor in 
evolution, because it can proceed much more rapidly than the gradual 
accumulation of variations or mutations from generation to generation. 
Indeed, it can proceed at a rate which, on a geological scale, is nothing 
short of explosive suddenness. 

This is precisely what has happened to the human species in the 
past few centuries, after a period of preparation extended over perhaps 
an equal number of millennia. 

Curiosity, that ingrained trait of human nature, is the mainspring of 
this type of orthogenesis. In these days it is rather out of fashion to 
extoll the virtues of pure science, or even to attempt any distinction 
between pure and applied science. But it remains a fact of superlative 
importance that men have questioned Nature from simple curiosity, 
and have wormed out many of her secrets, with no conscious intent to 
make so-called useful application of the knowledge gained—or indeed 
without suspecting that it might bring catastrophe crashing down upon 
the peoples of the earth. These applications, favorable or unfavorable 
to the species, have come after the scientific discovery—and not infre- 
quently a long time after, figuring in the ordinary human time scale. 
Indeed, some of these applications were not possible until considerable 
progress had been made in the unprejudiced interrogation of Nature. 
This is the typical characteristic of orthogenesis—continued movement 
in a set direction, independently of the utility or harm of the result, 
until a stage is reached where selection has something tangible to lay 


A. J. Lotka, Physical Biology, 1925, pp. 296, 3783 
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hold on, and to operate either for greater survival, or, it may be, for 
destruction. 

This then is the sickness with which the human species is sick: 

By a process of orthogenesis, that is, through the accumulation of 
knowledge and skills from generation to generation, man has developed 
marvelous powers for the individual to achieve his purposes. But he 
has failed, and failed very badly, to organize these purposes for their 
survival value. Man’s evolution, in this respect, has been lopsided, and 
there is evidence in the rocks that such lopsided development is a 
danger signal for the species. 

In these times of grave historical events, even an informal address 
from your retiring president cannot fail to be projected against the 
background of the weighty issues that are occupying the minds of all 
responsible men and women. The winning of the war is necessarily the 
task of our military leaders and the forces bravely carrying out their 
strategies. The winning of the peace is the concern of us ali, but more 
especially of those who will give orientation to purposive thought. 

It is an unfortunate fact that to guide men into competent purposive 
courses is far more difficult than to teach them the use of ingenious 
contrivances for attaining such ends as they may set themselves. We 
give a man an automobile and show him bow to turn a switch, shift a 
lever, step on the gas, and away he goes. He needs no persuasion that 
the scheme works—the demonstration would convince even that leg- 
endary skeptic who, on seeing a giraffe, remarked “there ain’t no such 
animal.” 

But in guiding men’s purposes we have no such compelling demon- 
strations to present as a means of persuasion. Added to the difficult 
problem of devising efficient policies, there is the still more difficult one 
of overcoming resistance to their adoption. This resistance is twofold. 
The people must first be persuaded of the need and the reasonable ef- 
fectiveness of the proposed policies; second, what is even more difficult, 
they must be induced to sacrifice certain personal advantages in the 
pursuit of purposes transcending individual interests. 

And there is another important and fundamental difference between 
the influence of organized reflective thought in determining group or 
collective activity of human society and, on the other hand, the role of 
organized purposive thought in directing group behavior: It does not 
require any very high intelligence for the individual of the rank and 
file to profit by the accomplishments of our technical experts and 
scientific geniuses. In this connection, a series of photographs repro- 
duced in the newspapers the other day told an instructive fable. It 
showed a genial chimpanzee going through the orthodox motions of 
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lighting a cigar and proceeding thoroughly to enjoy a good smoke. But 
it was a little artistic detail that caught my particular attention: This 
chimp, we are informed, always begins by carefully stripping the brand 
label from the cigar. He does not, of course, know why he does this ex- 
cept that he has seen it done. I am afraid that many of our actions have 
not a very much sounder basis. And so long as we are dealing with 
what I might call the mechanics of everyday life, this very diluted 
participation in the management of the world works satisfactorily 
enough. We can leave the thinking to others, and just push buttons, 
relying on the mysterious mechanism behind them to work according to 
schedule. 

But this sort of thing is quite inadequate for the direction of collec- 
tive human activity by and for organized purposive thought. In a 
democracy, at any rate, the individual must personally and under- 
standingly participate in the management of affairs. This makes high 
demands on the general level of intelligence, wisdom, and enlightened 
patriotism under our form of government. 

And here let us do our thinking as statisticians, and above all as 
realists. Evil men and groups of such there will presumably always be 
among the earth’s population. No scheme which fails squarely to face 
this fact can receive our serious consideration. And the danger probably 
can never be eliminated that, from time to time, an evil genius may give 
false and wicked orientation to the purposive thought of multitudes. 

I say let us do our thinking as statisticians, for the problem with 
which we have to deal is fundamentally a problem of numbers. 

There are certain established, and on the whole very effective, meth- 
ods of dealing with evil men on a civil scale. There are punishments as 
deterrents, and there are ways of restraining and constraining the un- 
ruly, if need be with capital finality. But fundamental in these meth- 
ods is a numerical situation: The law-abiding and law-enforcing 
element, in civil life, in national life, immensely outnumbers the 
miscreants. 

Unfortunately, when we seek to translate into international pro- 
cedures the principles and methods of civil law enforcement, we are 
met at the very start with a complete lack of parallelism. Nations, un- 
like individuals, are not counted in millions, not even in hundreds; and 
those of dominating power, not even in tens. This completely upsets 
the parallel. 

When an issue arises between nations, it is not a case of a handful of 
men against millions, but at best of one nation against a very limited 
number of others—four or five perhaps if the contention is among the 
world’s great powers. What is more, the extreme contrast of right and 
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wrong in the present contest must not obscure to our sight the fact 
that ordinarily issues are not thus drawn with indisputable clarity, and 
if sides are taken, say, by five powers or their representatives, as to the 
merits in a dispute, the split may not be four to one, but three to two, 
in favor of one or the other contention. Such a split as this will in no- 
wise guarantee a decision without the use of force, or a wise or good 
decision, with or without the use of force. And if the use of force should 
be enjoined by some policy previously agreed upon, to settle such a 
dispute, there is still no guarantee that parties only indirectly interested 
will carry out their mandate. Past experience does not encourage con- 
fidence in the reliability of such undertakings. And, most important of 
all, in addition to these negative failings that may be inherent in a 
plan devised to maintain peace, it may have the very serious positive 
defect of giving rise to a false sense of security. Here also past experi- 
ence is not encouraging. Where there are brigands it is still the part of 
a wise traveler to carry arms. 

Does this mean, then, that we are to regard as entirely hopeless all 
attempts to organize men’s purposes to transcend personal interests, to 
work for greater and better harmony? 

If, as statisticians, we have recognized the problem as being, in one 
respect at least, a problem of numbers, does this mean that the cards 
are stacked against us in our desire to establish and preserve a peace 
with honor and decency? 

No, we shall not consider the case hopeless. Between an attitude of 
infantile optimism, at one extreme, and giving way to senile defeatism 
at the other, there surely is a middle ground of adult realism, which 
does not dwell in a world of illusions, but discerns difficulties for what 
they are. 

When we labor to win the war, our aim is single and clear; the prob- 
lem is only one of the means to achieve this clear purpose. But to speak 
of winning the peace is to telescope into one phrase what is really a 
discouragingly complex problem, even in merely focusing the target 
at which to aim. 

The effort to win the war will be brought to a final climax in due time. 
But the peace can be secured and preserved only by lasting measures 
pursued unrelentingly through all time. 

Among such measures may I in concluding suggest only one: A 
world cataclysm has welded together for the time being the United 
Nations. Let us work to preserve this unit. The parable of the mote and 
the beam applies to nations as it does to individuals. Let us bury minor 
criticisms of our Allies and friends in garlands of praise for the deathless 
glory of their magnificent stand against our common foe. 











INTERNAL MIGRATION AND THE WAR* 


By Henry S. Suryrock, Jr.t 
Bureau of the Census 


URING THE past few years, workers and their families have migrated 
D in such large numbers to centers of war production that they have 
created serious problems of housing, transportation, health, and educa- 
tion. These community stresses and strains have focused the public’s 
attention on the population growth in many areas. There has been no 
systematic quantitative description, however, of the volume of in- 
ternal migration or of the net gains made by the various areas of the 
country. Still less has the volume of recent migration been compared 
with the volumes of migrations in other periods of our history. The 
opinion has been expressed! that the extent of recent movements has 
been exaggerated in many of the popular stories in newspapers and 
magazines, since it takes a relatively small number of migrants to 
create numerous problems in a community, particularly an already 
congested one. There have been many attempts to attribute to popula- 
tion growth or in-migration, changes that may be due partly, or even 
solely, to fuller employment and greater purchasing power. Crowding 
on public vehicles; increased demand for gas, electricity, and tele- 
phones; larger payrolls or sales; and fewer housing vacancies have all 
been loosely interpreted as evidence of migration into an area. A better 
understanding of the problems before us should be afforded by isolation 
of internal migration, as one of the components. 

The present consideration of internal migration is limited by the 
available data mainly to net migration for States and an approximation 
to net migration for counties. It would be extremely interesting to 
examine gross migration also, but pertinent statistics are exceedingly 
scanty. For other reasons, immigration and emigration to and from 
continental United States are not included. Movements of the latter 
type have by no means disappeared with the war, but they are relatively 
unimportant. 

Estimates of net migration during the period April 1, 1940, to May 1, 
1942, represent the difference between the estimated actual population 
on May 1, 1942, and the population estimated for that date on the as- 


* This paper was prepared for presentation at the 104th Annual Meeting of the American Statistical 
Association which was to have been held at Cleveland, Ohio, December 29 to 31, 1942. 

t For part of the discussion of metropolitan counties, the writer has drawn on an address read by 
Dr. Philip M. Hauser, Assistant Director of the Bureau of the Census, to the Wartime Marketing Con- 
ference of the American Management Association on January 15, 1943, in Chicago. He is also indebted 
to Mr. Robert J. Milliken of the Population Division of the Census Bureau, who supervised the cal- 
culations and gave valuable suggestions on methodology and interpretation. 

1 Myers, Howard B., “Defense Migration and Labor Supply,” this Journat, Vol. 37, p. 72, March, 
1942, 
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sumption of no net movement. The estimates of actual population are 
those based mainly on the number of registrations for War Ration Book 
Number One (Sugar) and published in two recent releases of the Census 
Bureau.? The method of their preparation, together with brief com- 
ments on the limitations of the data, is described in these releases. A 
much more complete description of methodology and possible biases 
will be given in a forthcoming article by Norman Lawrence.’ The popu- 
lation estimates omit persons in the armed forces, and the migration 
estimates are restricted to net interstate movements of civilians. 

Obviously, these estimates of internal migration are affected by the 
possible biases elsewhere described in the rationing registration data 
as representative of the noninstitutional civilian population in an area. 
It is felt, however, that these shortcomings do not invalidate the general 
picture of internal migration here presented. 

The other half of the comparison—the “expected” State civilian pop- 
ulations—was estimated by a variation of the standard method. From 
the population of a State at the census date, the births and deaths 
(both corrected for underregistration) through April, 1942, were added 
and subtracted, respectively. An estimate of the State’s share of the 
net immigration from outside continental United States was also added. 
Finally, an allowance was made for the number of persons who had 
entered the armed forces from each State. The resultant estimate of 
expected civilian population subtracted from the corresponding actual 
population should represent the postcensal net civilian in-migration 
from or out-migration to other States. The algebraic sum of these 
figures for all States is zero. 

For the United States as a whole, inductions and enlistments ex- 
ceeded the natural increase combined with the small net civilian im- 
migration from abroad, so that there was a slight loss in civilian popula- 
tion during these two years. The net migration figures presented in the 
table are roughly similar to the changes in civilian population. Where- 
ever the estimated net migration is algebraically greater than the esti- 
mated population change, then the loss of men to the armed forces 
must have exceeded natural increase, and vice versa. Thus the high 
fertility States in the South show more increase in civilian population 
than in-migration or less decrease in population than out-migration. 

Nineteen States and the District of Columbia had net in-migration 
between 1940 and 1942; 29 States had net out-migration. The geo- 


2 U.S. Bureau of the Census. Sixteenth Decennial Census of the United States. “Preliminary Estimates 
of the Civilian Population by States: May 1, 1942,” Series P-3, No. 30, October 27, 1942; and “lrelim- 
inary Estimates of Civilian Population in Selected Metropolitan Counties: May 1, 1942,” Series P-3, 
No. 31, December 1, 1942. 

+ Entitled “War Registration as a Source of Population Estimates.” 
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graphic pattern may be seen from a glance at Chart I. In New England 
the rural, northern States lost civilian population by migration, and 
the urban, southern ones gained. Among the Middle Atlantic States, 
only New Jersey had a net in-migration. All of the East North Central 
States gained through migration, whereas all of the relatively more 
rural West North Central States lost. Losses were particularly severe 
in the tier of Great Plains States extending from North Dakota through 
Oklahoma. In the South, net in-migration was experienced by Dela- 
ware, Maryland, the District of Columbia, Virginia, Florida, Alabama, 
Louisiana, and Texas. These are all coastal States, and their gains were 
concentrated largely in their coastal metropolitan areas. All the Moun- 
tain States except Nevada suffered net out-migration. On the Pacific 
Coast, during the period under consideration, net gains occurred in 
California and Washington. 

The highest percentage gain from net in-migration occurred in the 
Pacific Division, where there was a net gain amounting to 4.5 per cent 
of the 1940 civilian population. This was followed by the East North 
Central Division with 2.4 per cent. The New England and South At- 
lantic Divisions had net in-migrations of one per cent or less. The 
largest percentage of net out-migration (3.3 per cent) was suffered by 
the West North Central Division. Almost as high a net out-migration 
percentage was found in the Mountain Division, which was followed 
by the East South Central, Middle Atlantic, and West South Central 
Divisions, in that order. The largest number of net in-migrants— 
638,000 persons—was received by the East North Central Division; 
and the largest number of net out-migrants—448,000—left the West 
North Central Division. 

California led the States in total number of net in-migrants with 
395,000, and 4 States (Michigan, Illinois, Virginia, and New Jersey) 
and the District of Columbia had between 100,000 and 300,000. On 
the other hand, New York lost 421,000 through net out-migration, and 
4 States (Oklahoma, North Carolina, Kentucky, and Minnesota) lost 
between 100,000 and 200,000. The District of Columbia’s net migration 
was almost 25 per cent and far surpassed the percentage gain of any 
of the States. Only four of these (Nevada, California, Michigan, and 
Maryland) registered gains of five per cent or more. Material to be 
presented for counties will show how unevenly these migratory gains 
were distributed within the States having large net in-migration. 

It should be emphasized before proceeding, however, that the figures 
considered above deal only with the balance of in-migration and out- 
migration for each State. The gross interstate migration was un- 
doubtedly far larger. For example, according to provisional figures from 
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1940 to 1942 per year migration rate 
Region, Division, Per 
and State cent 1940 1930 1920 1940 | 1930 | 1920 
Number | of 1940 to to to to to to 
popu- 1942 1940 1930* 1942 | 1940 | 1930* 
lation 
Regions: 
The North — 122,000 —0.2 —58,700 | —49,600 —4,300 | —0.1 | —0.1 — 
The South — 182,000 —0.4 —87,300 | —85,800 | —133,400 | —0.2 | —0.2 | —0.4 
The West 304,000 2.2 146,000 135,400 137,700 1.1 1.1 1.3 
The North: 
New England 80,000 1.0 38,400 —2,000 —31,400 0.5 = —0.4 
Middle Atlantic —392,000 —1.4 — 188,200 21,900 47,100 | —0.7 0.1 0.2 
East North Central 638,000 2.4 306,200 —1,800 67,000 1.1 —- 0.3 
West North Central | —448,000 | —3.3 | —215,100 | —67,700 —87,000 | —1.6 | —0.5 | —0.7 
The South: 
South Atlantic 107,000 0.6 51,400 10,400 —65,400 0.3 0.1 | —0.4 
East South Central | —197,000 | —1.8 —94,600 | —42,600 —58,600 | —0.9| —0.4/| —0.6 
West South Central —92,000 | —0.7 —44,100 | —53,600 —9,400 | —0.3 |} —0.4| —0.1 
The West: 
Mountain — 132,000 —3.2 —63,300 | 5,100 —23,900 | —1.6 0.1 | —0.7 
Pacific 436,000 4.5 209,300 130,300 161,600 2.1 1.5 2.3 
New England: 
Maine —16,000 | —1.9 —7,700 —100 —5,700 | —0.9 —_— —0.7 
New Hampshire —7,000 | —1.4 —3,400 1,300 —2,500 | —0.7 0.3 | —0.5 
Vermont —12, —3.4 —5,800 —1,900 —2,900 | —1.7 | —0.5 —0.8 
Massachusetts 21,000 0.5 10,100 —6,700 —18,200 0.2 | —0.2 | —0.4 
Rhode Island 22,000 3.1 10,600 100 —2,000 15/ — | -—03 
Connecticut 72,000 4.2 34,600 5,300 —100 20; O03; — 
Middle Atlantic: | 
New York —421,000 -—3.1 — 202,100 49,200 | 52,200 | —1.5| 04 0.5 
New Jersey. 126,000 3.0 60,500 300 29,000 1.4 -- 0.8 
Pennsylvania 97,000 | —1.0 —46,600 | —27,600 —34,100 | —0.5| —0.3 | —0.4 
East North Central: } 
Ohio 77,000 1.1 37,000 | —4,500 13,900 0.5! —0.1 |; 0.2 
Indiana 76,000 2.2 36,500 2,400 300 Bul 01; — 
Illinois 188,000 2.4 90,200 — 4,600 21,400 11}; -0.1) 03 
Michigan 282,000 5.4 135,300 | 4,600 40,700 2.5 0.1 0.9 
Wisconsin 15,000 0.5 7,200 300 —9,300 0.2 =) —0.3 
West North Central: } 
Minnesota —102,000 |} —3.7 | —49,000|} 3,500 | —19,900} —1.8/ 0.1} -—08 
Iowa —89,000 —3.5 —42,700 —8,300 —20,500 | —1.7 | —0.3 | —08 
Missouri —31,000 | —0.8 —14,900 —6,100 —9,400 | —0.4/| —0.2 | —03 
North Dakota —52,000 —8.1 —25,000 | —11,400 —9,500 | —4.1 | —1.7 | —1.5 
South Dakota —55,000 | —8.6 —26,400 | —11,300 —5,700 | —4.3 | —1.7 |} —0.8 
Nebraska —61,000 | —4.6 —29,300 | —15,800 —10,700 | —2.3 | —1.2| —0.8 
Kansas — 58,000 —3.2 —27,800 | —18,300 —11,300 | —1.6 | —1.0| —0.6 
South Atlantic: 
Delaware 12,000 4.5 5,800 1,800 —2,200 2.1; 0.7) —1.0 
Maryland 93,000 5.1 44,600 9,500 —800 2.4 0.5 | —0.1 
District of Columbia 164,000 24.9 78,700 16,300 2,800 10.6 2.7; 06 
Virginia 130,000} 4.9 62.400 —900 | —23,600} 23| — | -10 
West Virginia —60,000 | —3.2 — 28,800 —9,800 —6,400 | —1.5| —0.5| —0.4 
North Carolina —168,000 | —4.7 —80,600 | —10,000 —1,000 | —2.3/; -03| — 
South Carolina —43,000 | —2.3 —20,600 | —12,700 —25,900 | —1.1 | —0.7 | —1.5 
Georgia — 85,000 —2.7 —40,800 | —14,800 —41,900 | —1.3 | —0.5| —1.5 
Florida 64,000 3.4 30,700 31,000 33,600 1.6 1.9 2.7 
East South Central: | 
Kentucky —108,000 | —3.8 —51,900 | —11,600 —20,800 | —1.9| —0.4/ —0.8 
Tennessee —25,000 | —0.9 —12,000 —3,100 —12,000 | —0.4/} —0.1 | —0.5 
Alabama 25,000 0.9 12,000 | —19,800 — 15,300 0.4 | —0.7 | —0.6 
Mississippi —89,000 | —4.1 —42,700 —8,100 | -—10,500 | —2.0| —0.4| —0.5 
West South Central: | 
Arkansas —17,000 | —0.9 —8,200 | —17,300 —21,600 | —0.4 —0.9 | —1.2 
Louisiana 46,000 2.0 22,100 1,7 —3,000 0.9 0.1; —0.2 
Oklahoma —168,000 | —7.2 —80,600 | —32,400 —5,600 | —3.6 | —1.4)| -—0.3 
Texas ,000 0.7 22,600 —5,600 20, | 04; -0.1; O84 
Mountain: | | 
Montana —31,000 —5.6 — 14,900 —1,300 —8,800 | —2.8| —0.2 | —1.6 
Idaho —50,000 | —9.5 | —24,000 3,400 —5,800 | —4.8| 0.7] —1.3 
Wyoming — 13,000 —5.4 —6,200 400 —600 | —2.6/} 02| —03 
Colorado —28,000 | —2.5 —13,400 1,500 —3,700 | —1.2 0.1 | —0.4 
New Mexico —15,000 | —2.8 —7,200 1,500 —2,500| -14; 03] —0.6 
Arizona —10,000 | —2.0 —4,800 500 1,300 | —1.0 0.1 0.3 
Utah —6,000 —1.1 —2,900 —2,400 | —4,000 | —0.5 | —0.5| —0.8 
P svete 21,000 19.1 10,100 1,500 200 8.5; 1.5 0.2 
acific: 
Washington 49,000 2.9 23,500 12,800 2,800 1.4 0.8 0.2 
Oregon —8,000 | —0.7 —3,800 11,200 7,200 | —0.4 1.1 0.8 
California 395,000 5.7 189,600 | 106,300 151,600 2.7 1.7 3.2 
_ * Adapted from Thornthwaite, C. Warren, Internal migration in the United States (Study of Popu- 
lation Redistributiou), Philadel hia University of Pennsylvania Press, 1934, Map facing p. 22. Ex- 
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the 1940 Census, Georgia, which has been experiencing net out-migra- 
tion for years, attracted over 150,000 in-migrants, 1935 to 1940. 

The Bureau of Agricultural Economics estimates‘ that in 1941, when 
the net migration was 1,357,000 from farms, 2,171,000 persons moved 
from farms and 814,000 persons moved to farms. 

Furthermore, more persons moved within States than moved from 
one State to another. According to the WPA surveys described by 
Myers,® the average move of workers to most defense centers during 
the pre-war phase of this period was less than 125 miles. Between 1935 
and 1940, moreover, provisional census data indicate that in most of 
the Southern States the number of migrants from one county to an- 
other within the State exceeded the number from other States. 

The effect of the defense and war programs on internal migration 
can best be evaluated by a comparison with internal migration during 
earlier periods. Unfortunately, no comparable figures are readily avail- 
able for the last war or even for the entire decade 1910 to 1920. Esti- 
mates of net interstate migration from 1920 to 1930 were computed 
by Thornthwaite® on the basis of census age data. Similar estimates 
have been worked out for the decade from 1930 to 1940. (See table.) 
These net migration figures are for the total rather than the civilian 
population, but the armed forces in continental United States during 
these periods were negligible. All figures are for movements within 
continental United States, however. In order to allow for the dif- 
ferent lengths of the periods, comparisons are made on the basis of 
the average number of net migrants per year. In the computation of 
annual net migration rates, the population at the mid-point of the 
period was used as the base. 

It must be recognized that there was not a uniform annual flow of 
migrants during these periods; sometimes even the direction of net 
migration may have shifted from year to year. At the beginning of the 
’twenties, adjustments were still being made to the population shifts 
effected by World War I; there was a severe but relatively brief depres- 
sion followed by a long period of prosperity. The second of these decades 
contained years of severe depression and partial recovery. 

Since there is more opportunity for the flow of migration to be re- 
versed in a 10-year than in a 2-year period, average yearly net migration 
tends to be smaller in the longer periods. By and large, however, the 


4 Bureau of Agricultural Economics, “Farm Population Estimates: 1910-1942.” Washington, No- 
vember, 1942, p. 2. 

5 Myers, op. cit., p. 74. 

* Thornthwaite, C. Warren, Internal Migration in the United States (Study of Population Redis- 
tribution) Philadelphia, University of Pennsylvania Press, 1934, map facing page 22. Immigrants from 
abroad have been removed. These estimates, unlike those for the other periods, do not include move- 
ments of children born during the period. 
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direction of net migration for a State was probably fairly consistent 
within periods. It is pertinent to point out in this connection that, 
according to the Bureau of Agricultural Economics, there has been a 
net migration to farms only once (1932) since 1920 despite the sharp 
economic fluctuations in these years. 

The Pacific Division was the only one that had a net in-migration 
during the postcensal years and in both of the preceding decades. The 
West North Central and East South Central Divisions had net out- 
migrations in all three periods, as did the West South Central except 
that in-migration almost balanced out-migration during the ’twenties. 
Net movements in the other divisions have not been so consistent. 

Among the States, the balance of migration has consistently favored 
the District of Columbia, Florida, California, and, to a lesser degree, 
Nevada and Washington. The depression of the ’thirties interrupted 
the flow into industrial New Jersey, Ohio, Illinois, and Michigan. Many 
States had net out-migration in all three periods: Vermont, Pennsyl- 
vania, Iowa, Missouri, North and South Dakota, Nebraska, Kansas, 
West Virginia, South Carolina, Georgia, Kentucky, Mississippi, Arkan- 
sas, Oklahoma, Montana, and Utah. The war brought net in-migration 
for the first time in Alabama, whereas New York had not previously 
experienced a net out-migration. 

A comparison of the yearly rates of net migration among the three 
periods (disregarding signs) shows that in most divisions and States 
net migration was larger in the war period than in the two preceding 
decades.? This was probably true also of gross internal migration. It is 
likely that migration from the Great Plains was even heavier than 
during the severe drought years of the ’thirties. The in-migration for 
Florida, however, has been declining. 

An examination of internal migration in terms of States alone is not 
very satisfactory. Wide intrastate differentials in terms of subregions, 
urban and rural areas, and metropolitan districts and other territory, 
would certainly be expected. The Office of Price Administration in 
Washington has compiled the number of registrations for War Ration 
Book Number One by counties but not for cities or metropolitan dis- 
tricts. Estimates of civilian population on May 1, 1942, were accord- 
ingly prepared for counties. In order to obtain approximations to the 
census metropolitan districts, which are defined in terms of townships 
or other minor civil divisions, those counties with 50 per cent or more 
of their total 1940 population living in a metropolitan district were 

? The rates for the period from 1920 to 1930 would have been slightly higher if the movement of 


children under 10 years old at the end of the decade had been included, so that comparisons with this 
decade must be made with caution. Usually, however, this age group is not highly mobile. 
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designated “metropolitan counties,” and estimates were prepared for 
such counties, or groups of counties, corresponding roughly to each 
metropolitan district. 

It would be too laborious a task to compute net civilian migration 
for each county by the method used for each State. Consequently, we 
shall have to infer migratory movements from population growth. It 
may be recalled that natural increase was about the same as inductions 
into and enlistments in the armed forces for the country as a whole. If 
this were true for each area, then any change in civilian population 
could be attributed to migration. Although there are geographic varia- 
tions in age composition, sex ratio, fertility, and mortality (which would 
affect shifts of civilian population to the armed forces as well as natural 
increase), there is, for States, a high correlation between net migration 
and civilian-population change. Except for the influence of natural in- 
crease, the contrast between the growth of metropolitan counties and 
that of the rest of the country would be even greater, since natural 
increase is usually lowest in these counties.® 

Of 137 metropolitan counties or groups of counties, 88 showed a gain 
in civilian population between April 1, 1940, and May 1, 1942, 43 
showed a decline, and 6 showed no appreciable change.® For those 
counties in which the civilian population increased, the total gain 
amounted to 2,639,000, or 6.8 per cent of their 1940 civilian population. 
The net increase in the civilian population of all of the metropolitan 
counties was 1,853,000, or 2.8 per cent. 

Very large gains in civilian population have taken place in the metro- 
politan counties centering in the following cities: Detroit, a gain of 
336,000; Washington, D. C., 231,000; Chicago, 149,000; Los Angeles, 
131,000; Norfolk, Portsmouth, and Newport News, 107,000; St. Louis, 
97,000; San Diego, 97,000; and San Francisco and Oakland, 95,000. 
The largest percentage gains were recorded in the areas centering in 
San Diego; Norfolk, Portsmouth, and Newport News; Mobile; Corpus 
Christi; and Washington, D. C.—all of which had a civilian population 
gain of more than 25 per cent. The largest loss, 365,000, occurred in the 
New York-Northeastern New Jersey metropolitan counties. 

Population changes in the metropolitan counties may be examined 
for four regions: the Northeastern, the North Central, the Southern, 
and the Western. The metropolitan counties in the South showed the 


* The birth rate has risen markedly as a result of the war and its concomitant nearly full employ- 
ment, but a comparison of natural increase with population increase in a sample of rapidly growing 
metropolitan districts shows that the excess of births over deaths was a minor factor. 

*U. S. Bureau of the Census. Sixteenth Decennial Census of the United States. “Preliminary Esti- 
mates of Civilian Population in Selected Metropolitan Counties: May 1, 1942,” Series P-3, No. 31, De- 
cember 1, 1942. 
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greatest annual rate of increase in population between 1940 and 1942, 
3.9 per cent, as compared with an annual rate of increase of 2.7 per cent 
in the West, 2.0 per cent in the North Central Region, and an annual 
rate of decrease of 0.6 per cent in the Northeast. 

Moreover, in the South the small metropolitan counties (those having 
central city populations of less than 100,000 or a total population of 
less than 150,000) had a slightly higher annual rate of increase than 
did the large metropolitan counties, 4.1 per cent as compared with 3.9 
per cent. In each of the other three regions, the large metropolitan 
counties grew more rapidly than did the small. 

As might be expected, metropolitan counties with high rates of 
growth were in practically every case those with more than average 
wartime activity, whereas those with heavy rates of decline were rela- 
tively inactive and contained relatively few important plants working 
on war contracts. There was no apparent relationship, however, be- 
tween the industrial distribution of employed workers in a metropolitan 
area and its rate of growth, except that centers of shipbuilding and 
aircraft production had very large population increases. 

Again we should compare the special type of population growth oc- 
curring in the defense and war periods with the growth of metropolitan 
counties during earlier years. The average yearly rate of increase from 
1940 to 1942 was 1.3 per cent, which is significantly larger than the 
average rate of 0.8 per cent prevailing in the years from 1930 to 1940, 
which included a number of depression years. The recent rate falls 
short, however, of the yearly rate (2.7 per cent) prevailing between 
1920 and 1930, a period when urbanization was extremely rapid. It was 
only in the South that the yearly rate of increase since the last census 
was greater than the yearly rate of increase both between 1930 and 
1940 and between 1920 and 1930 (3.9 per cent as compared with 1.7 
per cent and 2.9 per cent). 

It is possible, however, that there was an even greater concentration 
of population during the defense period than during the ’twenties. Not 
only did metropolitan counties grow at the expense of other counties; 
but among metropolitan counties it has been shown that in most parts 
of the country, the larger ones grew more than the smaller. 

Since the size of the civilian population of the country as a whole 
changed very little between April, 1940, and May, 1942, and since the 
137 metropolitan counties, which contained about half the total popula- 
tion, had a gain of about 2.8 per cent, it is obvious that the remaining 
counties must have lost population at about this rate. In fact, only 
about one-fifth of these counties gained civilian population. Only 9.4 
per cent gained as much as 5 per cent, only 2.9 per cent as much as 15 
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per cent, and only 1.3 per cent as much as 25 per cent. Many of the 
counties (37.9 per cent) gained or lost less than 5 per cent of their 1940 
civilian population during these 25 months. More than half of the coun- 
ties, 52.6 per cent, declined more than 5 per cent, 20.0 per cent declined 
more than 15 per cent, and 2.4 per cent declined more than 25 per cent. 

Nonmetropolitan counties with extremely large increases in civilian 
population were mostly in the South and West, and many of them con- 
tain ordnance plants or army camps. (Civilians are attracted to the 
vicinity of army camps, first, by the need for construction workers and, 
later, by the opportunity to sell goods and services to soldiers.) Typical 
examples are Mineral and Clark Counties, Nevada, boom-centers of 
basic magnesium production, with a naval ammunition depot and an 
airfield, respectively; the independent city of Radford, Virginia, which 
contains a shell loading plant; Kitsap County, Washington, site of one 
of the country’s largest navy yards; Orange County, Texas, a ship- 
building center; and Fairfax County, Virginia, which is partly in the 
Washington, D. C., metropolitan district but was not classified as a 
metropolitan county, according to the 50-per cent rule which was used. 

In considering the net migration for States from 1940 to 1942, several 
States (Nevada, California, Michigan, Maryland, and Virginia) were 
singled out as having experienced very high rates of net in-migration. 
It is now possible to show how these gains were distributed within 
these States. In Nevada the population gain in Mineral County and in 
Clark County exceeded that in the State as a whole, indicating a loss 
in the balance of the State. Similarly, the combined gains in the Los 
Angeles, San Francisco, and San Diego metropolitan counties exceeded 
those in the State of California, although there were also sharp increases 
in a few of the small counties. In Michigan the Detroit metropolitan 
counties alone had a larger population increase than the State as a 
whole, but Kalamazoo, Muskegon, and a few other counties also shared 
in the gain. More than nine-tenths of Maryland’s population increase 
occurred in the Baltimore and Washington metropolitan counties. 
Finally, Virginia’s increase was concentrated in the Hampton Roads 
area, the Washington metropolitan counties, and in a few of the moun- 
tain counties where the increase generally was due to operation of a 
single large industrial plant. In short, even in these States a majority 
of the counties lost civilian population, largely through out-migration 
to other counties in the State. 

The situation was more consistent in States with heavy out-migra- 
tion. In the tier of States from North Dakota down through Oklahoma, 
all of which lost population, only 33 counties increased, and 4 of these 
were metropolitan counties. 











NO OO ge Se 


~— ee 








OPP 





; 
' 








- INTERNAL MIGRATION AND THE WAR 27 


The shifts of population since 1940 cannot be described simply as 
rural-to-urban migration. It has been shown that almost one-third of 
the metropolitan counties or groups of metropolitan counties lost civil- 
ian population. Many of these were in the Northeast. Counties in and 
around such large cities as New York, Pittsburgh, and Boston were in- 
cluded. It appears rather that movements were primarily to industrial 
areas active in war production or to scattered rural counties containing 
army camps or newly erected war plants. Although according to esti- 
mates by the Bureau of Agricultural Economics,’® the net migration 
from farms in 1941 (1,357,000) was the largest on record, during this 
same year the WPA migration surveys showed that few workers mi- 
grating to defense centers came from agriculture. Possibly the migrants 
from farms filled the “vacuum” left by migrants to defense centers. 

Since the characteristics of persons registering for War Ration Book 
Number One have not been compiled, the characteristics of net civilian 
migrants are not obtainable from estimates based on this source. The 
most extensive source of information on the characteristics of net mi- 
grants is represented by previously mentioned surveys conducted by 
the WPA. These were restricted to in-migrants into about 50 cities of 
25,000 or more inhabitants and covered movements during a period of 
roughly one year, beginning in October, 1940. Unfortunately for our 
purposes, some of the more pertinent characteristics, such as age and 
sex, are shown only for workers and not for all in-migrants. 

Statistics on color and size of family are among those shown for all 
in-migrants. The surveys also presented the origin of all in-migrants 
in terms of geographic area, distance, and size of place. Distributions 
presented only for migrant workers include those by age and sex, in- 
dustry and occupation before and after migration, and employment 
status in present place of residence (by sex, age, month of arrival, and 
industry and occupation at last place of residence). 

In summarizing these statistics, Myers states that in proportion to 
their number in the population, Negroes have made up only a small 
proportion of the migrants to defense centers. “In most areas fewer 
than five per cent of the migrants are Negroes, and even in the South, 
migration rates for Negroes are much lower than for whites. This is 
understandable, in view of the wide-spread discrimination against 
Negroes in war industries. It contrasts sharply with experience in the 
first World War, however, when a large scale migration of Negroes to 
Northern industrial centers took place. As the demand for labor in- 
creases and present employment restrictions are relaxed, it is probable 
that Negroes will begin to move in greater numbers.”" 


” Loc. cit. 11 Myers, op. cit., p. 74. 














AMERICAN STATISTICAL ASSOCIATION: 





28 


The migrant workers included relatively few females, again because 
of discrimination in hiring. Females were probably also underrepre- 
sented in the total number of migrants partly for this reason and partly 
because many male migrant workers have left their wives and other 
female dependents behind. In most defense centers, between 20 and 50 
per cent of lone migrants had dependents living elsewhere. It is possible, 
of course, that the breadwinner, after a period of adjustment, was joined 
by other members of his family. Housing shortages would make the 
uniting of these families very difficult, however. 

As usual, the migrant workers were characteristically young persons. 
“In nearly half of the cities the average age of all migrant workers 
is less than 30 years; in no city does the average rise as high as 35 
years.”!? The average age of all migrants must have been even younger. 

It is difficult to find comparable statistics for earlier migrations. 
The net interstate migration during the period 1930 to 1940 has been 
estimated by age, color, and sex, and a very rough comparison may be 
made on the basis of these data. It is not certain, of course, that the 
groups having the highest net migration rate also had the highest gross 
rate, but such a correspondence seems likely. During the last inter- 
censal decade, the mobility of Negroes on the basis of net interstate 
migration was much higher than that of whites. Negro females were 
more mobile than Negro males, and white males were only slightly more 
mobile than white females. These differences seem to contrast sharply 
with those found in the early period of defense activity. Migrants in the 
two periods were probably more similar in terms of age composition. 
Between 1930 and 1940, persons 25 to 29 years old had larger net in- or 
out-migration rates than persons in any other 5-year age group. Since 
the migration occurred at an unspecified time during the decade, these 
young adults could have been up to 10 years younger when they made 
the indicated move from one State to another. 

It is well to preface a summary by reminding the reader that ac- 
curacy of the net civilian interstate migrations from 1940 to 1942, 
which are the main feature of this article, depends on the accuracy of 
population estimates based on statistics of registration for sugar ration- 
ing, some of which figures are still provisional. It is clear, nonetheless, 
that the internal migration brought about by the war and defense pro- 

grams is of a very different character from that of movements in the 
preceding years of peace. The special character and location of war jobs, 
the withdrawal of men from the civilian labor force into the armed 
forces, and the rapid tempo of economic change have produced a very 


12 Myers, op. cit., p. 75. 








see + ee opener 

















- INTERNAL MIGRATION AND THE WAR 29 


unusual situation. This situation is reflected in the relatively limited 
number of areas that have attracted large numbers of in-migrants and 
in the personal characteristics of the in-migrants. 

Other data also confirm the conclusion that internal migration in the 
defense and early war periods can not be characterized simply as arural- 
to-urban movement. Those metropolitan counties and a few scattered 
rural counties that have been active in war production because of the 
contracts secured have been practically the only areas to gain civilian 
population hy net migration. In some cases where a large army camp 
or ordnance plant had to be built in jig time in the open country, the 
population during the period of construction may have exceeded the 
present population. Population growth, and consequently net in- 
migration, have been phenomenal in some areas. In other areas where 
extraordinary growth has been claimed in overenthusiastic chamber of 
commerce and newspaper estimates, net in-migration has been small 
or nonexistent. 

During the period covered by the WPA migration surveys, from Oc- 
tober, 1940, to the summer or fall of 1941, it was observed that in- 
migration to defense centers was proceeding at an accelerating pace. 
Estimates of population made for Washington, D. C.," in the three six- 
month periods from July 1, 1940, to January 1, 1942, show successively 
larger rates of population increase. There is also evidence of an ac- 
celeration of migration since May 1, 1942, into certain areas, such as the 
Baltimore and Portland, Oregon, metropolitan counties. For the coun- 
try as a whole, it can not be ascertained whether the peak of war- 
induced internal migration has yet been reached or, if so, when. 

In addition to the usual selection of young adults in the wartime 
migration, there seem to have been, at least in the first year or two of 
the postcensal period, selections on the basis of color and sex. White 
males seemed to have formed a larger proportion of the migrant workers 
to defense centers than would be expected on the basis of their propor- 
tion in the labor force. It is possible, however, that their places may 
have been taken by a secondary migration of other persons. As the war 
proceeds and the manpower situation becomes even more serious, there 
may be waves of migration of Negroes, women, and even elderly per- 
sons. If migrant husbands are eventually able to find living accommoda- 
tions for their families, the sex ratio among migrants will be reduced 
by this cause alone. 


13 U.S. Bureau of the Census. Sizteenth Decennial Census of the United States. “Estimated Popu- 
lation of Washington, D. C., by Six-Month Periods: July 1, 1940, to January 1, 1942,” Series P-3, No. 27, 
May 25, 1942. 
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According to Myers, the defense migration has been remarkably 
successful. Unemployment among migrant workers in defense centers 
is very low, and it is probable that most of them have improved their 
economic position. The wartime internal migration, however, has 
created many problems, both for the migrants and for the communities 
that they have entered. There are obviously many new problems con- 
nected with local government, health, housing, traffic, and education. 
The process of migration and conditions in boom towns have had im- 
portant repercussions upon the family, juvenile delinquency, and re- 
creation. People who are interested in the decentralization of industry 
as a means of securing more wholesome living conditions and a more 
nearly uniform distribution of economic well-being are now confronted 
by greater concentration of industry and population than existed in 
1940. Even before the war, students like Creamer" have shown that de- 
centralization of industry was not going on. 

Hauser, in an address on “Population Shifts and Income Changes” 
(as yet unpublished), read before the American Management Associa- 
tion on January 15, used the quartile in which the rate of growth of 
a metropolitan county or group of counties fell during each period to 
evaluate the permanency of the wartime population shifts. (The periods 
were the same as those used in this article; namely, 1940 to 1942, 1930 
to 1940, and 1920 to 1930.) On the basis of the simple criterion of pre- 
viously observed growth during two periods characterized by radically 
different economic conditions, he concluded that on “a regional basis 
it is clear that the metropolitan centers which have grown most rapidly 
and which . . . have the best prospect of retaining their growth are in 
the South and West.” 

Whether the enormous wartime gains in population made by particu- 
lar areas are retained after the war would seem to depend in large 
measure upon the resiliency of the area’s economy. If the industries in 
these areas can not shift readily to peacetime production, there may 
well be wholesale repatriation of migrants, leaving ghost towns in their 
wake, or else depressed areas supporting heavy relief loads. This last 
would be further complicated by residence requirements for relief and 
unemployment compensation. The major determinant in the post-war 
period, however, may be drastically new government policies and pro- 
grams that will far outweigh the effects of the factors just mentioned. 


4 Creamer, Daniel B., Is Industry Decentralizing? (Study of Population Redistribution), Philadel- 
phia, University of Pennsylvania Press, 1935. 























POPULATION AND WAR LABOR SUPPLY* 


By Wi.ii1am H. Mavrtz anp Joun D. Duranp 
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is primarily a function of manpower, materials, and management. 
During our first year in the war, we have made great progress in solving 
problems of materials and management; manpower is now the Number 
1 problem. The length of the war will depend to a large extent upon 
our success in attaining the most effective utilization of this nation’s 
human resources. 

Labor supply problems are involved in nearly every phase of the war 
effort. They include allocation of manpower between the armed forces 
and civilian production, transfer of workers from nonessential indus- 
tries, reduction of unemployment and absenteeism, lengthening work- 
ing hours, more efficient utilization of labor resources by means of 
training and upgrading, and increasing the total labor supply mainly 
by recruiting additional workers from the population normally not in 
the labor force. This paper deals with the latter part of the problem; 
the mobilization of the nation’s potential labor supply. 

The War Manpower Commission’s estimates of labor requirements 
call for a total labor force of 62,500,000 by December, 1943, including 
armed forces of 10.7 millions.! This means an increase during the cur- 
rent year of 3,500,000 over the estimated 59,000,000 in the labor force 
in December, 1942, and a total increase of 9,500,000 since March, 1940. 
From what sources in the population have the 6,000,000 workers come 
who were added to the labor force between March, 1940, and December, 
1942? Can these sources supply another 3,500,000 workers in 1943? 
Upon what groups of the population and in what areas should the 
efforts to recruit additional workers be focussed? 

These are primarily demographic questions, since the size, geographi- 
cal distribution, and characteristics of the population are basic determi- 
nants of the nation’s potential manpower and of the measures that are 
appropriate to realize ‘it. Fortunately, the most comprehensive inven- 
tory of this nation’s human resources that has ever been available is 
assembled in the tabulations of the 1940 population census, including 
a wealth of information about the characteristics and geographical 
distribution of the labor force and of persons not in the labor force. In 


em EFFECTIVENESS With which a modern nation can wage total war 


* This paper was prepared for presentation at the 104th Annual Meeting of the American Statistical 
Association which was to have been held at Cleveland, Ohio, December 29 to 31, 1942. 
1 War Manpower Commission, estimates prepared in January, 1943. 
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addition, the monthly sample survey of the labor market (transferred 
to the Census Bureau from the Work Projects Administration in 
August, 1942) provides current data on the size and composition of the 
labor force. 





SOURCES OF ADDITIONAL WORKERS 


Since the first determinant of national labor supply is the size of the 
population, population growth is an important source of additional 
manpower. Growth of the population of working age provided almost 
1,500,000 of the 6,000,000 increase in the labor force between March, 
1940, and December, 1942, and will provide a further increase of 500,000 
by December, 1943. The remaining 3,000,000 additional workers 
needed by the end of 1943 will have to be obtained by still more ex- 
tensive employment of persons normally not in the labor force. 

Employment of men.—There is very little room for expansion of the 
male labor force except among boys of school age. The 1940 census 
showed only 10,600,000 males 14 years old and over not in the labor 
force, of whom 3,700,000 were reported as unable to work or confined 
to institutions. The age classification of men not in the labor force in 
March, 1940, is shown in Table I. 














TABLE I 
MALES NOT IN LABOR FORCE: MARCH, 1940* 
(millions) 
J y c oO i 
Age Total t nable we work In school thers not in 
or in institutions labor force 
14 years and over 10.6 3.7 4.6 2.3 
14 to 17 years 4.0 0.1 3.5 0.4 
18 to 44 years 2.5 0.8 0.7 
45 to 64 years 1.5 1.0 -— 0.5 
65 years and over 2.6 1.8 — 0.7 





* U. 8S. Bureau of the Census, “Labor Reserves in the United States by Age, Marital Status, and 
Sex” (Release Series P-14, No. 4), October 9, 1942. More detailed data will appear in the forthcoming 
Census Bulletin, Characteristics of Persons not in the Labor Force. 


The 1,800,000 able-bodied men 18 to 44 years old who were not in the 
labor force at the time of the census were mostly older students, many 
of whom have been inducted into the armed forces since that time, 
or have left school early to take war jobs. Probably not very many more 
workers can be obtained from this group. 

Most of the 1,200,000 men over 45 who were not in the labor force 
but able to work in 1940 have doubtless become workers since that time, 
and even the group of 2,800,000 shown in the census as unable to work 
or in institutions has apparently made a sizeable contribution to the 
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labor supply. About 1,300,000 men over 45 were added to the labor 
force between March, 1940, and October, 1942. About 500,000 of these? 
were men over 65. These increases have resulted partly from popula- 
tion growth, partly from the postponement of retirements, and partly 
from the return to work of men who had previously withdrawn from the 
labor force. 

There is still opportunity for some further expansion of the labor 
force in this direction. The monthly survey of the labor market for 
November, 1942, which included the question, “Could this person take 
a full-time job for wages if one were available within 30 days?” in- 
dicated that about 300,000 men over 45 years old would be available 
for work. About 200,000 of these were over 65 years of age.* Neverthe- 
less, the increase in the labor force in the male population over 45 is 
nearing the limit imposed by disability, chronic illness, and old age. This 
part of the population cannot supply a large share of the additional 
labor force needed during the war. 

Employment of children.—Boys and girls of ’teen age have provided 
a considerable part of the additional labor force up to the present. The 
usual influx of students into the labor market during summer vacations 
has been abnormally large in the last two years, and has constituted an 
important source of labor supply, especially in farming areas. In addi- 
tion, year-round increases have been obtained from students dropping 
out of school to take war jobs, or doing part-time work after school 
hours. In March, 1940, the number of persons 14 to 19 years old in the 
labor force was 3,900,000 ;* the corresponding figure for March, 1942, 
was 4,800,000.° In the summer of 1942, seasonal employment of young 
people on a scale unprecedented in recent years raised the labor force 
in this age group to a peak of 8,400,000 in July. By October, 1942, the 
figure had declined seasonally to 5,600,000, but was still 1,700,000 
higher than that for March, 1940. Much of this increase undoubtedly 


2? U.S. Bureau of the Census, “Age of the Labor Force, by Sex, for Regions, Divisions, and States, 
and for Cities of 100,000 or More: 1940” (Release Series P-16, No. 1), August 20, 1942; and Monthly 
Report on the Labor Force, “Age and Sex of Persons in the Labor Force: June 1941-—October 1942” 
(MRLF-No. 4), December 12, 1942. Although the results of the Work Projects Administration's “Sam- 
ple Monthly Report of Unemployment” for the week of March 24 to 30, 1940, were not in complete 
agreement with the employment status data obtained in the decennial census enumeration for the 
same week, it is believed that the differences do not invalidate any of the conclusions of this paper. 
Studies are in progress which will provide the basis for more precise comparisons between 1940 census 
data and the results of the monthly sample surveys. 

*U. 8S. Bureau of the Census, Monthly Report on the Labor Force, “The Nation's Available Labor 
Reserve; November, 1942” (MRLF—No. 7). This report gives the distribution by sex, age, farm resi- 
dence, previous work experience, and other characteristics of persons not in the labor force who indicated 
that they were available for full-time employment. 

‘U.S. Bureau of the Census, Release Series P-16, No. 1. 

5 U.S. Bureau of the Census, MRLF—No. 4. 
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came from persons past high-school age, but a considerable part of it 
must have resulted from increased employment* of children under 18. 

The relatively high wages now being offered to inexperienced young 
workers will probably continue to attract young people from the 
schools to the war factories.” Full-time employment of children under 
18 is restricted by child-labor and compulsory school-attendance laws, 
and by established standards of education, but these restrictions have 
been relaxed somewhat since this country entered the war. Some States 
have modified their child-labor laws, and there has been a tendency to 
enforce these laws less strictly in view of the manpower shortage. Not 
very many more children under 18, however, can be brought into the 
labor market as full-time workers unless a wholesale sacrifice of educa- 
tional standards is decided upon for the sake of additional labor supply 
during the war emergency. Otherwise, the principal contribution of 
children under 18 to the war labor supply is likely to be in the form of 
part-time and seasonal work. 

Employment of women.—Further large additions to the labor force 
will evidently have to come mainly from the female population. Women 
who enter the labor force normally remain for only a relatively short 
period after leaving school, and before getting married and starting to 
raise a family.® 

During the last two years, women have been entering the labor 
market earlier and in larger numbers, and remaining longer than in 
normal times. Some women who had retired from the labor force to take 
up homemaking have returned to work, and a considerable number of 
housewives who were not previously workers have been attracted to 
the labor market by higher wages, better jobs, and patriotic appeals. 
Inductions of married men in the armed forces have forced some wives 
to take jobs to support themselves and their families. The absorption 
of women into the war labor force has been facilitated by establishing 
training schools for war production jobs, by simplifying machinery and 
breaking down complex industrial operations, and more recently by 
encouraging part-time work. Nurseries for the children of working 

6 In the case of males, the majority of the increase between March, 1940, and October, 1942, for the 
age group 14 to 19 must have come from boys under 18, since the increase for the ages 14 to 19 (1,000,000) 
exceeded the number of males 18 and 19 years old not in the labor force in March, 1940 (850,000). The 
authors estimate that perhaps 800,000 of the total increase of 1,700,000 for males and females 14 to 19 
years old occurred in the ages 14 to 17. 

7In November, 1942, there were 500,000 girls and 200,000 boys 14 to 19 years old not in labor 
force who were available for full-time jobs. U. S. Bureau of the Census, MRLF—No. 7. 

8 For data on labor-force status of women by age, marital status, and household relationships, see: 
U. 8. Bureau of the Census, “Household Relationships of the Labor Force” (Release Series P-14). More 


detailed statistics will appear in the forthcoming Census bulletin, The Labor Force—Employment and 
Personal Characteristics. 
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mothers have been organized in many localities to free women for war 
work. The result of these developments has been an increase of 2,700,000 
in the female® labor force between March, 1940, and December, 1942. 
Most of the increase has come from girls in their ’teens and early 20’s, 
and women between the ages of 35 and 55. 

The authors estimate that the reservoir of women 18 years old and 
over not in the labor force stood at 33 million at the end of 1942. From 
this reserve must be drawn most of the 3,000,000 additional workers 
needed by the end of 1943. In November, 1942, 4,500,000 women not 
in the labor force were available for full-time employment.' 


TABLE II 


FEMALE POPULATION AND PER CENT IN THE LABOR 
FORCE, BY FAMILY STATUS; MARCH, 1940* 








— 














Family status 18 to 44 18 to 24 25 to 34 35 to 44 
years old years old years old years old 
Female population (millions) 
6 cin akebicdddheensgkeinsieeniwas 28.2 8.4 10.8 9.1 
Married, husband present.................. 18.2 3.3 7.9 7.0 
Without children under 10................ 8.3 1.5 2.8 4.0 
With 1 or more children under 10......... 9.9 1.9 5.1 2.9 
I I 6 5. 06k 6s0:0eceeecsansecs 10.0 5.0 2.9 2.1 
EE ere ee eee ne ee v.00 4.7 2.0 0.9 
Widowed, divorced, or married, husban 
RETR EES LARS ee Ne 2.4 0.3 0.9 1.2 
Per cent in labor force 
nnd 6a thd a cheteseebneekennees 33.9 43.2 32.7 26.7 
Married, husband present.................. 16.0 15.6 17.4 14.6 
Without children under 10................ 25.7 27.3 33.4 19.7 
With 1 or more children under 10........... 8.0 6.4 8.8 7.6 
can is ctuncsdsnsaceanne 66.3 61.5 74.5 66.5 
a cca ae cama eee hie ania katoi 67.7 62.0 78.5 73.3 
Widowed, divorced, or married, husband 
Sia cone dhiek ew adeeb means 61.4 53.9 64.9 61.1 





* U.S. Bureau of the Census, Release Series P-18, No. 13. 


Employment of women under 45.—The chief obstacle to the expansion 
of the labor force among women between the ages of 18 and 44 is the 
responsibility of caring for their homes and families, and especially for 
young children. The importance of these factors is shown by Table II 
on the labor-force status of women 18 to 44 years old according to 
family relationships, based on 1940 census data. 


* U. S. Bureau of the Census, Monthly Report on the Labor Force, “Estimates of Labor Force, Em- 
ployment, and Unemployment in the United States During the Week of December 6-12, 1942” (MRLF— 
No. 6), January 9, 1943. 

10 U.S. Bureau of the Census, MRLF—No. 7, special report on available labor reserves, November 
1942, 
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Even in 1940, two-thirds of the unmarried women in this age group, 
and one-fourth of the married women without children were in the 
labor ferce. Only 8 per cent of the mothers of young children were 
workers, and these few were probably mostly women who could hire 
servants or arrange for someone to take care of the children during 
working hours, or mothers who were forced to work by pressure of 
poverty. 

The distribution by family status of women 18 to 44 years old who 
were not in the labor force in 1940 shows that there was not very much 
room for expansion of the labor force in this age class without the em- 
ployment of women with considerable home responsibilities. Of these 
women, 82 per cent were married with husband present in the house- 
hold, and 60 per cent of those who were married had one or more chil- 
dren under 10 years of age. Only 18 per cent were single, widowed, 
divorced, or married, husband absent." 

Up to the fall of 1942, nearly 1,400,000 women 18 to 44 years old had 
been added to the 1940 labor force; about half of these were in the age 
group 18 to 24, and half in the group 35 to 44 years old.” Since most of 
these additions were probably drawn from the unmarried women and 
from the married women without children, the margin for further in- 
crease in the labor force through employment of women with rela- 
tively light home-making responsibilities is probably considerably 
narrower now than in 1940.'* Moreover, substantial increases in birth 
and marriage rates since 1940 have still further reduced the reserve of 
unmarried women without children." 

In order to provide additional information regarding the relation 
between employment of women and their family responsibilities, the 
Bureau of the Census has made extensive tabulations of 1940 census 
data on labor-force status of women according to number of children 
and various social and economic characteristics. In these tabulations, 
women in and outside of the labor force are classified by age, marital 
status, number of children under 5 and 5 to 9 years old, color, value of 
home, income of husband, education, and other characteristics." 


11 U. 8. Bureau of the Census, Release Series P-18, No. 13. 

12 The October, 1942 Monthly Report on the Labor Force (MRLF—No. 4) showed an increase of 
1,600,000 since March, 1940 for the female labor force in the age group 14 to 44 years. The authors 
estimate that about 200,000 of this increase occurred in the ages 14 to 17. 

18 Forty per cent of the women who reported that they were available for employment in Novem- 
ber, 1942, stated that they were responsible for the care of children under 14 years of age. U. S. Bureau 
of the Census, MRLF—No. 7. 

44 Hauser, Philip M., “Population and Vital Phenomena,” American Journal of Sociology, November, 
1942. 

48 For results of these tabulations, see: U. S. Bureau of the Census, Release Series P-18, Nos. 1-13. 
Additional data will appear in the forthcoming Census Bulletins, The Labor Force—Sample Statistics, 
Employment and Family Characteristics of Women, and Families—Employment Status. 
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Employment of older women.—Among women over 45, family re- 
sponsibilities are only a secondary factor limiting the expansion of the 
labor force, since comparatively few women over that age have any 
young children. Many of these women, of course, are so heavily bur- 
dened with keeping house for older children or other dependents, or 
for a large family of adults, that they have no time for other employ- 
ment. These limitations, however, are probably less important than the 
handicaps of age and lack of recent work experience, which make it 
more difficult for women over 45 to get jobs than for younger women. 

In spite of these handicaps, the number of women over 45 in the 
labor force has increased considerably during the last two years. Be- 
tween March, 1940, and October, 1942, the female labor force increased 
by one-sixth in the age group 45 to 54 years; and by one-fifth in the 
group 55 to 64 years.'* There has undoubtedly been a tendency for 
women reaching the ages at which they would normally retire, to post- 
pone their withdrawal from the labor market; but this tendency alone 
is not sufficient to account for the large increases which have occurred. 
Employers have apparently had to hire more older women because 
younger workers were not available. 

Women over 45 will probably continue throughout the war period 
to be asource of considerable numbers of workers. The 4,500,000 women 
not in the labor force but immediately available for work in November, 
1942, included 1,300,000 women 45 to 64 years old and 100,000 over 
65.'7 Unless the problem of relieving younger women of their family 
responsibilities is successfully solved, these older women will have to 
provide an increasing share of the needed labor supply. Training for 
older women who have not been employed for many years is especially 
important for the efficient utilization of this part of the potential labor 
supply. 

The principal sources of additional labor supply in the population, 
and the rates at which these reserves are being absorbed into the labor 
force are summarized in Table III. 

The increase in the employment of men over 45 has probably about 
exhausted the possibilities of labor-force expansion in this part of the 
population. In October, 1942, the proportion of workers had reached 
92 per cent among men 45 to 64 years old, and nearly 50 per cent among 
those 65 years old and over. These proportions are probe. _ approach- 
ing the limits imposed by old age and disability. The proportion of 
children 14 to 17 years old in the labor force increased from about one- 
eighth in March, 1940, to about one-fifth in October, 1942. Further in- 


% U. S. Bureau of the Census, MRLF—No. 4. 
17 U. S. Bureau of the Census, MRLF—No. 7. 
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creases in the employment of children will probably have to be con- 
fined largely to seasonal and part-time work. Increases in the employ- 
ment of women have been much smaller in proportion to the size of this 
part of the potential labor supply. Women 18 to 64 years old, of whom 











TABLE III 
Labor-force increase Persons not in labor force, 
Population group March, 1940, to October, 1942 October, 1942* 
(millions) (millions) 
Children 14 to 17 years old 0.8 7.4 
Men 45 years old and over 1.3 3.6 
Women 18 to 44 years old 1.4 18.1 
Women 45 to 64 years old 0.5 10.4 
Women 65 years old and over ~- 4.6 





* Estimated on the basis of data from the Monthly Report on the Labor Force for October, 1942 
(MRLF-No. 4) and estimates of population by age and sex as of that date. 


there were over 28 million not in the labor force in October, 1942, are 
the population group upon which attention must be concentrated in the 
effort to add 3,000,000 more workers to the labor force by the end of 
1943. 


GEOGRAPHICAL DISTRIBUTION OF THE POTENTIAL LABOR SUPPLY 


When expressed in national totals, the manpower problem is over- 
simplified. The concrete problem is to provide the labor supply needed 
in the aircraft factories and shipyards of Baltimore, on the farms of 
Aroostook County, Maine, and in each of hundreds of other local labor 
markets. Comparisons of national labor requirements with sources of 
additional workers in the national population are misleading unless 
allowance is made for the heavy concentration of manpower needs in 
certain areas. 

Lack of balance between the geographical distribution of the popula- 
tion and the distribution of war labor requirements enormously com- 
plicates the problem of efficiently utilizing the nation’s human resources. 
In areas where comparatively little essential war production is carried 
on, labor surpluses tend to develop when nonessential production is 
curtailed, and it is not easy to transfer these surpluses to areas of labor 
shortage. In the New York City area, for example, where there has 
been relatively little expansion of war production, labor reserves are 
not easily mobilized for employment in critical labor-shortage areas in 
upstate New York or even in nearby New Jersey. 

Internal migration during the last two years has played a very im- 
portant part in bringing the supplies of labor more nearly into balance 
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with the demands.'* The possibilities of migration are limited, however, 
by lack of housing, transportation, and other facilities, in the crowded 
cities where war industry is centered. In these areas, future increases 
in the labor force will have to be obtained to a greater extent from the 
population already present. Reserve labor supplies among women and 
young persons will have to be employed much more extensively in such 
areas than in the nation as a whole. 

The provision of an adequate farm labor supply is one of the most 
difficult problems in the geographical distribution of manpower. Oc- 
cupational deferments for agricultural workers will do much to halt 
the depletion of farm labor supplies through inductions into the armed 
forces, but the drain resulting from migration of workers from farms 
to industrial centers is likely to continue. Farm youths are attracted 
to the cities not only by prospects of higher wages but also by the more 
varied amusements and less isolated existence which urban life offers. 
Meanwhile, heavy demands are being made upon agriculture to provide 
food for the armed forces and the lend-lease program. The problem of 
agricultural labor supply is complicated by seasonal variations in farm 
labor requirements, by difficulties resulting from the remote situation 
of farms, and by special obstacles to the employment of women in the 
farm population. 

Seasonal peaks in farm labor needs are normally met partly by 
extensive employment of the women and children in the farm popula- 
tion, and partly by means of hired labor, which is supplied in some 
areas largely by migratory workers. During the last two years the 
normal sources of both hired labor and family workers have been con- 
siderably reduced. The deficiencies have been met partly by still more 
extensive employment of women and children at seasonal peaks, and 
in some cases partly by organized recruiting of school children and 
women in nearby cities and towns to act as temporary workers for such 
operations as fruit harvesting. Even with the use of such devices it will 
probably be difficult to obtain the labor supply needed for the expanded 
1943 agricultural production program. 

The problem of replenishing the full-time farm labor supply is still 
more difficult. Urban workers are likely to be reluctant to take jobs 
on farms, because of the disadvantages of isolation, the arduous char- 
acter of the work, and the comparatively low wages. Less than one- 
sixth of the persons not in the labor force who were available for jobs 
in November, 1942, were willing to take a farm job, although nearly 
one-fifth of them lived on farms.'® 


18 Shryock, Henry S., Jr., “Internal Migration and the War,” in this issue of the JournNaL. 
1” U. S. Bureau of the Census, MRLF—No. 7. 
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Some of the year-round labor requirements can be supplied by farm 
housewives and by farm children working part-time after school, but 
there are many kinds of farm work that women and children cannot 
regularly do. Moreover, farm housewives tend to be heavily burdened 
with housekeeping duties which greatly limit their time for other work. 
The average wife in a rural-farm family has more children to care for 
than the city wife, and enjoys fewer of the conveniences which lighten 
the housekeeping load, such as washing machines and vacuum cleaners. 

These obstacles are reflected in extremely low proportions of farm 
women in the labor force. Only 15 per cent of all women 18 to 44 years 
old in rural-farm areas (outside of metropolitan districts of 100,000 
or more) were in the labor force in March, 1940; the corresponding 
proportion for women in other areas was 38 per cent. For married 
women, husband present, with one or more children under 10 years old, 
the proportions in the labor force were 4 per cent in farm areas and 9 
per cent in other areas.?° 

Even during the war emergency, it will be very difficult to surmount 
these obstacles to the employment of women on farms. It seems prob- 
able that the contribution of farm housewives to the war labor supply 
will be mostly in the form of seasonal work and relatively light part- 
time chores at other times of the year. 

The 1940 Census tabulations provide a large amount of information 
relating to potential labor supplies in particular areas. The tabulations 
mentioned above for women in and outside of the labor force by number 
of children, age, and other characteristics have been made for each 
metropolitan district of 100,000 inhabitants or more, and for urban, 
rural-nonfarm, and rural-farm areas of State groups.”* In addition, 
statistics for persons not in the labor force, by sex, subdivided into 
those engaged in own home housework, in school, unable to work, in 
institutions, and others not in the labor force, have been tabulated for 
rural-farm and rural-nonfarm parts of counties and for each urban 
place.” 

The 1940 data for particular labor-market areas can be applied to the 
analysis of present problems of labor supply if allowances are made for 
changes since the census date due to increases in employment, popula- 
tion growth, migration, and changes in marriage and birth rates. In 
most cases migration has probably had relatively little effect upon the 


2% U.S. Bureau of the Census, Release Series P-18, No. 13. 

*1 References to publications of these data are given above. 

2 U.S. Bureau of the Census, Sixteenth Decennial Census of the United States: 1940, Population, 
Vol. II, “Characteristics of the Population.” 

















ee 




















-PopULATION AND War LABOR SUPPLY 41 


reserves of potential workers not in the labor force, since the available 
evidence indicates that most of the migrants into centers of war produc- 
tion were in the labor force.” 


The problems of recruiting 3,000,000 more workers during 1943 from 
the population normally not in the labor force, and of training and plac- 
ing them in war jobs, will tax the facilities and ingenuity of our man- 
power agencies. The nonworker groups least affected by family re- 
sponsibilities, old age, physical disabilities, and other factors restricting 
employment, have for the most part already been added to the labor 
force. We are now faced with the task of taking an additional quarter 
of a million workers each month out of a labor reserve from which the 
most readily employable groups have already been taken. Only a small 
margin remains for further increase in the adult male labor force. A 
considerable reserve of seasonal workers exists among children under 
18 years old, but only a small proportion of them will be available for 
full-time work the year-round, unless we are willing to make serious 
sacrifices in educational and child-labor standards. Home responsibili- 
ties, and particularly the necessity of caring for their children, restrict 
the employment of young women. Up to the present, additions to the 
female labor supply in the age range 18 to 44 years have probably come 
largely from unmarried women and married women with few home- 
making duties—nonworker groups that were already comparatively 
small in 1940 and have dwindled steadily since that time. An important 
share of the additional war jobs has been filled by women between 45 
and 65, comparatively few of whom have young children to care for, 
but such women are handicapped by age and by lack of recent experi- 
ence, and many of them have heavy responsibilities for the care of older 
children or of large families of adults. 

Married women 18 to 64 years old, without children under 10, are 
the most important population group still not in the labor force. This 
pool of reserve labor supply will probably have to be drawn upon more 
heavily in 1943, in spite of their family responsibilities. Inductions of 
married men without children are likely to bring a considerable number 
of wives into the labor force, especially from the group who have 
hitherto refrained from taking jobs because of the possible effect upon 
their husbands’ draft classifications. Other factors which may lead to 
an increase in employment of married women are heavier tax burdens, 

* Myers, Howard B., “Defense Migration,” in: The Manpower of the Nation in War Production, 


Book One (Hearings before the Select Committee Investigating National Defense Migration, House of 
Representatives, 77th Congress, 2nd Session, H. Res. 113, Part 27), pp. 10318-10330, esp. p. 10325. 
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rising cost of living, higher wages, and more attractive employment 
opportunities. It is not likely, however, that such factors alone will 
bring about the needed increase in the labor supply. 

The employment of women with minor children, particularly children 
under 10 years old, should certainly not be encouraged if an adequate 
labor supply can be obtained from other sources. In the crowded centers 
of war industry, however, it is becoming more and more important to 
mobilize all possible labor reserves in the resident population, including 
some mothers of young children. If these women are to be employed, 
fairly extensive provisions for child care are likely to be needed in 
critical labor supply areas. To the extent that such measures succeed 
in cutting down further migration into these areas, they will greatly 
simplify problems of war housing, transportation, and post-war adjust- 
ment. 

A national view of labor reserves in relation to manpower require- 
ments gives only the general dimensions of the problem. National 
figures showing from what population groups additional workers can 
most easily be drawn, and what factors restrict their employment, are 
useful only as guides for establishing broad national policies. The crux 
of the manpower problem is in the local community. It is there that the 
problems of labor requirements, recruitment, training, and placement 
originate and must ultimately be solved. The success of national man- 
power policy will depend chiefly upon effective direction and coordina- 
tion of the activities of national manpower agencies, regional and local 
War Manpower Committees, and joint labor-management committees 
in the hundreds of local labor markets throughout the country. Several 
months ago the President aptly summarized the problem with the 
statement, “It is not that we do not have enough people in this country 
to do the job. The problem is to have the right numbers of the right 
people in the right places at the right time.” 

















STATISTICAL LAWS OF FAMILY EXPENDITURE* 


By Hotsroox WorkKING 
Food Research Institute, Stanford University 


HE FirsT result of a statistical study is a body of data exhibiting 
hy cutie facts regarding either a particular population or a particular 
sample from some population. Analysis of the statistics in conjunction 
with other pertinent information usually permits developing the origi- 
nal conclusions into broader statements of tendencies. If the indicated 
tendencies are sufficiently general, they may deserve characterization 
as scientific laws. 

Statistical data on family expenditures have been collected in a 
number of countries at intervals over a long period of years. The most 
recent large surveys of family expenditures, and in many respects the 
most significant among all such surveys, are two which were made in 
the United States in the middle 1930’s. The smaller of the two was 
conducted by the Bureau of Labor Statistics and was confined to fami- 
lies of wage earners and clerical workers. In that survey, data were 
obtained from some 16,000 families in 42 medium-sized and large cities 
and 13 small cities. The results have been published in a series of 8 
volumes of the Bureau of Labor Statistics. The larger of the two surveys 
was a joint enterprise of the Works Progress Administration, the 
Bureau of Labor Statistics, and the Bureau of Home Economics, with 
the cooperation of the National Resources Planning Board and the 
Central Statistical Board, and cov2red more than 300,000 families of 
nearly all occupations and degrees of social status in all the major 
types of community in the United States. The results have been pub- 
lished in some 47 volumes issued by the Bureau of Labor Statistics 
(22 volumes), the Bureau of Home Economics (22 volumes), and the 
National Resources Planning Board (3 volumes). 

The present paper gives some of the results of an attempt to draw 
from this great body of data some significant generalizations which may 
deserve to be regarded as laws of family expenditure. 

The distribution of family expenditure is conditioned by inherited 
and acquired wants of the members of the family, by social customs and 
standards, and by the array of goods and services from which the family 
may choose—to mention only the more important of the conditioning 
influences. Laws of expenditure applicable in one culture and in one 
epoch may not be applicable in another culture or in another epoch. 


* This paper was prepared for presentation at the 104th Annual Meeting of the American Statistical 
Association which was to have been held at Cleveland, Ohio, December 29 to 31, 1942. 
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Possibly certain broad laws of expenditure can be established as ap- 
plicable in widely different cultures and epochs, but the present inquiry 
does not aspire to such very broad generalizations. Its objective is to 
ascertain what uniformities can be discovered in tendencies of expendi- 
ture among families of different sizes, different incomes, and different 
occupational classes in different regions! and in different types of com- 
munity in the United States in the middle 1930’s. The categories of 
families studied had widely different degrees of education and cultural 
training. They moved in groups of quite different social standards, and 
they lived under substantially different climatic conditions. On the 
other hand, they all belonged, in a broad sense, to a common culture; 
and they all had available to them substantially the same goods and 
services at about the same relative prices. 

In the final paragraphs of the paper we shall give attention to some 
evidence on temporal changes in the distribution of family expenditures 
in the United States. The whole body of evidence suggests a rather re- 
markable degree of stability in the distribution pattern of family ex- 
penditures. 

Most persons who have provided data for a questionnaire on family 
expenditures or who have given thought to the difficulties they would 
encounter in providing such data entertain some degree of skepticism 
regarding the accuracy of statistics of expenditures obtained by the 
survey method. When working with averages from a great number of 
reports, the investigator need not be concerned over the accuracy 
of individual reports so long as there is good reason to believe that 
errors in one direction tend to be about as great, on the average, as 
errors in the opposite direction. If there exist biassed errors, however, 
arising from tendencies of reporting families to overstate or to under- 
state their expenditures in certain categories, the effect can be disturb- 
ing. Among the sources of error that might exist in the reports, those 
that would be most serious from the standpoint of the present study 
would be tendencies to bias that differed among different categories of 
the population. These would tend to result in evidence that expenditure 
tendencies differed where in fact they were similar. It would be only by 
an improbable chance that differing bias in reports made by different 
groups would result in indicating a common expenditure tendency 
where the tendencies were in fact substantially different. 

The data are found in fact to exhibit a high degree of uniformity in 


1 The regional aspect does not enter the present paper explicitly. It is considered in the larger study 
on which this paper is based and the resulting findings afford part of the basis for the conclusions here 
stated. 
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tendencies in the distribution of family expenditure. This finding bears 
strong testimony to the essential reliability of the data. It does not 
exclude the possibility, however, that there may exist some bias that 
appears in about equal degree in the reports by all classes of families. 
For example, it might be that expenditures in the miscellaneous cate- 
gory tend to be uniformly understated, owing to failure to remember all 
the small items that should be included. Any concern over possible 
errors of this sort may well be laid aside until tentative conclusions 
have been formulated from the data. Few of the conclusions here drawn 
would be affected to an important degree by any probable bias in the 
reports. 


I—GENERAL LAWS 


The most general observations that may be made regarding the dis- 
tribution of family expenditures concern the relation between size of 
the total expenditure per person and its distribution. Expressing the 
relationship in this form takes account of the principal effects of three 
more or less independent influences bearing on the distribution of 
expenditures: (1) income; (2) size of family; and (3) propensity to save. 
The proportions of total expenditure devoted to different purposes tend 
to be about the same for families of the same total expenditure per 
person even though the families differ with respect to income, size, and 
proportion of income saved. Certain exceptions to this generalization 
will be noted below. 

As total expenditure per person increases, the proportion of expendi- 
ture devoted to food decreases rapidly. At the same time, the proportion 
of expenditure devoted to recreation and transportation increases 
greatly, the proportion devoted to clothing increases moderately, and 
the proportion devoted to a miscellaneous category designated as 
“other items” increases slightly. The response of expenditure on home 
maintenance to changing total expenditure per person differs among 
different categories of the population. 

The relationship of expenditure for food to total expenditure is re- 
markable for the uniformity and closeness with which it approximates 
the relation 


F/T =a— blog T, 


where F represents expenditure for food and T represents total expendi- 
ture. Expressed otherwise, this means that the proportion of total 
expenditure that is devoted to food tends to decrease exactly in arith- 
metic progression as total expenditure increases in geometric progres- 
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sion. With only certain exceptions at very low levels of total expendi- 
ture, this law of relationship applies closely for families of every size, 
of every occupation, and in each type of community studied. 

One other quite general observation is suggested by the data. Ex- 
penditures on recreation and transportation, in the United States in the 
middle 1930’s, tended to reach an upper limit at about one-sixth of 
total expenditure. Except for low-income farm families, for whom 
transportation is less appropriately classed with recreation, expendi- 
tures on recreation and transportation averaged only about 4 per cent 
of the total at the lowest levels of total expenditure per person. As 
total expenditure increased in geometric progression, the proportion 
devoted to recreation and transportation tended, for all classes of 
families studied, to increase roughly in arithmetic progression until it 
reached about 16 per cent of the total, and thereafter to increase but 
little more. 


II—THE CATEGORIES OF EXPENDITURE 


The last of the apparent general laws of expenditure stated above 
required expression in quantitative terms which are dependent to some 
extent on the precise definition given the category of consumption 
under discussion. In subsequent consideration of the subject, quantita- 
tive expressions will play a large part. The expenditure shown for any 
category of consumption depends on the definition of the category 
which is to some extent arbitrary. We may well pause at this point to 
examine the definitions of the several categories of consumption here 
used. 

For the purposes of the present paper, all expenditures for consump- 
tion are distributed among five categories: (1) food; (2) home mainte- 
nance; (3) clothing; (4) recreation and transportation; and (5) “other 
items.” The proportion of total expenditure in each of these categories 
varies greatly with income per person in the family, and varies more 
or less with other circumstances; but it is useful to observe that, for 
the “average” family, each of the first two categories—food and home 
maintenance—comprised about one-third of its total expenditure, and 
the remaining three categories accounted for about equal parts of the 
remaining one-third of the total. 

Regarding the categories of food and clothing, it need be said only 
that the value of home-grown food consumed is included, that “food” 
includes also all beverages, and that clothing expenditures include 
expenditures for cleaning of clothes, but not for laundry. Under “home 
maintenance” I count expenditures for housing and for furniture and 
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equipment, and also expenditures for household operation, including 
payments for household help. This is a category covering the provision 
of satisfactions much broader than are suggested by the term “shelter” 
sometimes applied to it.? It covers provision for the preparation and 
serving of food in the home and for the laundry of clothing, and it 
covers much of the cost of providing those amenities of life that are es- 
sentially recreation enjoyed at home. It might be desirable to have a 
category of expenditures for shelter only, if that could be done; but the 
narrowest available category—that for “housing”—covers a great deal 
beyond expenditures for mere shelter, and covers more under some 
circumstances of occupancy of the family dwelling than under other 
circumstances. Under these conditions I have chosen to broaden the 
category to comprehend substantially all those expenditures that con- 
tribute toward affording the various satisfactions associated with 
housing. 

Recreational expenditures are defined in the data here used to in- 
clude among other things: expenditures for admissions of family and 
guests to movies, lectures, ball games or dances; expenditures for 
participation in sports and games (but not including sports clothing) ; 
expenditures on radio, music, photography and pets; dues to social and 
recreational clubs.* With these I include the much larger expenditures 
on automobile and other transportation. Certain minimum transporta- 
tion expenses, varying in amount from family to family, are quite 
unrelated to recreation, but expenditures for transportation above 
such minima may be regarded as essentially recreational expenditures. 

In the miscellaneous category, “other items,” the largest single com- 
ponent, accounting for about half of the total at each expenditure level, 
is medical care, including drugs. Other components are tobacco, toilet 
supplies, cosmetics, and personal services at barber shops and beauty 
parlors; education and special lessons in sports, music and dancing; 
books, newspapers and magazines; and some expenses not suggested by 
this list. Gifts and taxes are not here counted as expenditures for con- 
sumption. 


III—CONDITIONS OF LIFE AND SOCIAL STANDARDS IN RELATION TO 
EXPENDITURE PATTERNS 


Certain of the conditions under which a family lives and works in- 


? As in the publications of the National Resources Planning Board, for example, I use the term 
“home maintenance” following the practice of the Bureau of Labor Statistics. In the case of families 
owning their homes, estimated rental value is included. 

*See National Resources Planning Board, Family Expenditures in the United States: Statistical 
Tables and Appendizes, 1941, p. 143. 
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fluence its distribution of expenditures, and so also do the standards of 
the social group to which it belongs. The effects of these two types of 
influence are not in general clearly separable on a statistical basis, since 
differences in living conditions as actually observed are generally as- 
sociated with differences in social standards also, and there are lacking 
quantitative measures of living conditions or of social standards such 
as might permit use of multiple correlation analysis to ascribe to each 
its special influence. The effects of differences in living conditions and 
in social standards are readily observable in differences of expenditure 
patterns between urban, village and farm families, and in differences 
according to occupation. To some extent occupation is itself an index 
of living conditions. A special type of difference in living conditions 
is reflected in size of family. 

Differences in conditions of life and in social standards, so far as they 
are reflected in the bases of classification that I have employed, appear 
to have exerted their chief direct influence on expenditure for home 
maintenance. They were necessarily accompanied by opposite differ- 
ences in percentages of expenditure devoted to other categories of con- 
sumption. In some instances differences in expenditure for home main- 
tenance were offset largely by differences in expenditure for food, and 
in other instances mainly or entirely by differences in expenditures on 
recreation and transportation and on clothing. It appears also that 
comparisons between types of community must take into account dif- 
ferences in general cost of living that give the dollar a greater value in 
rural districts than it has in large cities. 

Effect of type of community.—As regards the proportions of family 
expenditure devoted to the three minor categories of consumption, 
extraordinarily little difference appears between urban, village, and 
farm families. The data are shown graphically in the right half of Chart 
I in terms of curves which reflect the relation of total expenditure per 
person to the per cent of the total falling into individual categories. In 
each type of community the relations accord with the general laws 
stated in Section I. 

Urban families spent less on recreation and transportation and less 
on clothing at each level of total expenditure per person than did either 
village or farm families. This rather surprising fact appears in a more 
reasonable light when it is noted that the percentages spent by urban 
families for recreation and transportation and for clothing at each level 
of total expenditure per person tended to be about the same as the per- 
centages spent for the same purposes by village and farm families with 
a total expenditure per person 20-25 per cent smaller. The general cost 
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of living is substantially lower in villages and in rural communities, as 
a rule, than in large cities, and it may well be that if comparison could 
be made between families with the same purchasing-power expenditure 
per person rather than between families with the same dollar expendi- 
ture per person, the percentages spent by urban, village and farm 
CHART I 
EXPENDITURE CURVES FOR URBAN, VILLACE, AND FARM FAMILIES* 


(Arithmetic scale for percentages; logarithmic scale for total expenditure per person.) 
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* Data computed from tables in Family Expenditures in the United States: Statistical Tables and 
Appendixes, National Resources Planning Board, 1941. 


families for recreation and transportation and for clothing would be 
found to have differed very little. 

The close similarities among urban, village and farm families as re- 
gards tendencies in allocation of expenditure to recreation and trans- 
portation, clothing and “other items” make it appear the more remark- 
able that they differ so greatly as regards expenditures for home main- 
tenance and for food. Among urban and village families spending a total! 
of about $200 per person, slightly less than 40 per cent of the total was 
spent for food. For farm families at the same expenditure level the 
food consumed was valued at over 50 per cent of the total expenditure.‘ 


‘ The imputed value of home-grown food is included in the figures for total expenditure as well as 
in the figures for expenditure for food. 
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On the other hand, the proportions of total expenditure that were de- 
voted to home maintenance by families spending about $200 per person 
were about 39 per cent for urban families, 33 per cent for village fami- 
lies, and only 21 per cent for farm families. If the data from which these 
percentages were derived represented only a few families in a few 
selected communities, these differences might deserve little attention. 
The percentages are based, however, on records for a great number of 
families in numerous communities widely scattered over the United 
States. The smooth course of the corresponding curves in the left half 
of Chart I attests the consistency of the data. The disparities among the 
curves unquestionably reflect existence of striking differences in ex- 
penditure tendencies among urban, village and farm families in the 
United States in the middle 1930’s. 

The differences between urban families and farm families as regards 
expenditures on food and on home maintenance are complementary, 
and both may arise mainly from conditions that encourage low expendi- 
ture on living accommodations on the farm. A minor factor in the 
difference is land rent. The ground on which the farm home stands is 
cheap by comparison with the price of a city lot. Indeed, land value 
may rather generally have been neglected in estimating the cost of 
home maintenance on farms. There is a tendency also for construction 
costs to be lower in the country than in the city and for old farmhouses 
to be valued much farther below replacement cost than similarly old 
dwellings in cities. There remains, however, the major fact: that farm 
families at a low economic level tend to be content with poorer living 
quarters, less well furnished, than city families at the same economic 
level. In a very real sense the farm itself is a home. On a farm, the 
family whose dwelling affords little but the minimum of shelter enjoys 
pleasures of living comparable with those to be obtained in a city only 
through substantially larger expenditure on the home itself. 

The saving which farm families tend to make on expenditure for 
home maintenance is not distributed in equal proportions among other 
categories of consumption, but tends to be reflected mainly in increased 
outlay for food. The large energy requirements of most farm work lead 
to heavy food consumption, home-produced foods tend to be used 
generously, and table waste, because it provides food for the farm ani- 
mals, is not watched as closely as with most city families in similar 
circumstances. 

As total expenditure increases, the expenditure pattern of farm fami- 
lies tends to approach that of city families. At an expenditure level of 
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$600 per person during the middle 1930’s, the distribution of expendi- 
tures by farm families differed little from that of city families. 
Expenditures of village families on food and home maintenance show 
characteristics much closer to urban families than to farm families. 
The differences between village and urban families as regards expendi- 


CHART II 


FOOD AND HOME MAINTENANCE EXPENDITURE CURVES FOR COLUMBUS, 
OHIO, FAMILIES CLASSIFIED BY OCCUPATION AND BY SIZE* 


(Arithmetic scale for percentages; logarithmic scale for total expenditure per person.) 
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reau of Labor Statistics Bull. 644, 1941.) 


tures on food and on home maintenance may arise mainly from dif- 
ferences in costs. 

The tendency for the proportion of expenditure devoted to food to 
decrease in arithmetic progression as total expenditure increases in 
geometric progression appears clearly in Chart I. With percentages 
plotted to an arithmetic scale and total expenditures to a logarithmic 
scale, all three curves of expenditure for food approximate closely to 
straight lines for total expenditures above $230 or less per person. 
Effect of occupation.—The special expenditure characteristics of farm 
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families are perhaps attributable quite as much to occupation as to 
place of residence. Among urban families, occupation had a consider- 
able influence on the distribution of expenditures, especially as regards 
expenditure on home maintenance. In the left half of Chart II are 
shown graphically the relations between total expenditure per person 
and the proportions of the total devoted to food and to home main- 
tenance by Columbus, Ohio, families in three occupational classes. The 
expenditure characteristics noted below in discussion applying specifi- 
cally to the data shown for Columbus have been found also in similar 
data for New York City, though with differences as regards the actual 
percentages at particular expenditure levels.§ 

Independent business and professional families tended to devote 
about 37 per cent of total expenditure to home maintenance at most 
expenditure levels, but there appears to have been a slight tendency 
for the proportion to decline as total expenditure increased, from nearly 
38 per cent for families spending under $700 per person to 35 per cent 
for families spending about $1,000 per person. 

Wage earners and clerical workers at the lowest levels of expenditure 
per person devoted over 38 per cent of their expenditures to home 
maintenance. The proportions declined markedly and in approximate 
arithmetic progression as total expenditure increased in geometric pro- 
gression. At the highest levels of expenditure per person recorded for 
families of wage earners and clerical workers, they devoted about 30 
per cent or less to home maintenance. At each level of total expenditure, 
the proportion devoted to home maintenance by clerical workers was 
above that for wage earners and below that for independent business 
and professional families. 

The differences between occupations as regards expenditure on home 
maintenance are not reflected in opposite differences in expenditure on 
food, the other major category of expenditure (Chart II). Instead, they 
were counterbalanced almost entirely by opposite differences in ex- 
penditures on recreation and transportation (data not shown). Much 
expenditure on home maintenance may be essentially expenditure for 
recreation. Thus the data suggest the inference that business and pro- 
fessional families tend to take more of their recreation in the home 
than do families of wage earners and clerical workers with the same 
total expenditure per person. It is to be noted, however, that among 
farmers—another occupational category—low expenditure on home 

5 The statistics have been so published that data such as are shown in Chart II can be obtained 
only by calculation for each city separately. In the published tables, the classification is by family in- 


come rather than by expenditure per person. The lines between plotted points in Chart II connect 
points corresponding to successive classes in the tabulation by family income. 
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maintenance does not go with high expenditure on recreation in the 
formal sense. 

Effect of size of family.—F amily size shows effects on the distribution 
of expenditure that arise partly from inherent “advantages of size.” 
The large family can make more efficient use of normal housing facilities 
and of the appurtenances of the home than can the small family. This 
seems the principal explanation of the large differences in expenditure 
on home maintenance shown by the curves of Chart II for families of 
different size. A tabular comparison for Columbus families having a 
total expenditure of $550—$650 per person appears as follows: 


Size and description of family Proportion for home 
maintenance 
2, husband and wife 40 per cent 
4, husband, wife, and 2 children under 16 32 per cent 
5-6, husband, wife, 1 child under 16, 1 person over 16, 
and 1 or 2 others of any age 27 per cent 


For each size of family, the data show a decline in proportion of ex- 
penditure for home maintenance as total expenditure increased up to 
a certain point, and then a tendency for the proportion spent on home 
maintenance to increase slightly with further increase in total expendi- 
ture. The combined figures for all urban families as shown in Chart I 
exhibit even more clearly this characteristic of expenditure on home 
maintenance. It results principally at least from the fact that at the 
lower and middle expenditure levels the percentages are dominated by 
data for families of wage earners and clerical workers, while at the 
upper expenditure levels the percentages are dominated by data for 
families of business and professional people. On this account it cannot 
be said that there exists a general tendency for the curves of percentage 
expenditure on home maintenance to be concave upward. Rather, 
there are different tendencies for different occupations, and for each 
occupation the expenditure curve, as previously noted, is nearly linear 
when plotted as in the accompanying charts. 

It is noteworthy also that for families spending over about $800 per 
person there was little or no difference between percentages spent on 
home maintenance by two-person and by four-person families. Although 
larger families use the facilities of the home more efficiently than two- 
person families, they have also more inclination to expand the satis- 
factions that the home affords. As between two-person and four-person 
families, these opposing influences offset each other at the higher levels 
of expenditure per person. 
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Expenditures for food by Columbus families of different sizes show 
a tendency for two-person families to spend more on food than did 
larger families with the same expenditure per person. The reason is not 
clear. Two-person families may tend to waste more food, they may tend 
to entertain more at meals, and/or they may tend to use more expensive 
food, bought in cans or at delicatessen stores. The logical suggestion 
that they may tend to spend more on purchased meals is contradicted 
by the statistics.® 

The curves of expenditure for food by families of different size show 
again the characteristic form of such curves. The proportion spent for 
food tends to decline in arithmetic progression as total expenditure 
increases in geometric progression. 

The relatively high expenditures by two-person families on both 
home maintenance and food were made possible in about equal degree 
by holding down expenditures on recreation and transportation and on 
clothing. Families of five or six persons with total expenditures under 
$400 per person took advantage of their economy on home maintenance 
chiefly to increase expenditure on clothing, but at higher levels of total 
expenditure they also divided the difference about equally between in- 
creased expenditure on clothing and increased expenditure on recrea- 
tion and transportation. 


IV—TEMPORAL CHANGES IN TENDENCIES OF FAMILY EXPENDITURE 


The foregoing observations may be used in conjunction with calcula- 
tions provided by W. Allen Wallis in a recent paper’ to throw some light 
on changes that occurred in tendencies of family expenditure in the 
United States during the first 35 years of the present century. For this 
purpose we may use most advantageously Wallis’ data on the amounts 
expended for certain categories of consumption by families at an aver- 
age income level.® 

Such great changes in value of the dollar as have occurred since the 
beginning of the century may be assumed to tend to cause horizontal 
shifts in such expenditure curves as are shown in Charts I and II 
above. Such shifts would tend to be accompanied by large changes in 
the percentage distribution of expenditures at any particular expendi- 
ture or income level. In interpreting Wallis’ data, this circumstance 


* Family Income and Expenditure in Nine Cities of the East Central Region, 1935-86: Vol. II, Family 
Expenditure (Study of Consumer Purchases: Urban Series, U. 8. Bureau of Labor Statistics Bull. 644, 
1941), pp. 133-135. 

7 ‘The Temporal Stability of Consumption Patterns,” The Review of Economic Statistics, November 
1942, XXIV, 177-183. 

8 See his figures in Column H, Table I, of the paper cited above. 
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must be taken into account. For the present, I attempt only such con- 
clusions as may be drawn without making quantitative adjustment for 
the price-level factor. 

A striking feature of the data presented by Wallis is the great in- 
crease between 1901 and 1935 in the principal components of expendi- 
ture for home maintenance. In terms of percentages of income reported, 
expenditure by urban families for rent or its equivalent and for fuel 
and light increased from 18.9 per cent in 1901 to 29.1 per cent in 1935. 
Expenditure for “household operation” increased between 1918 and 
1935 from 8.7 per cent to 12.1 per cent. These increases seem greater 
than can be explained by such horizontal shifts in the expenditure 
curves as might be expected to accompany the changes in general cost 
of living that occurred over the interval. 

Expenditures for recreation and transportation increased from 6.3 
per cent in 1918 (they are not given for earlier years) to 13.7 per cent in 
1935. This seems also a considerably larger increase than can be ac- 
counted for by a shift in expenditure curves resulting only from change 
in the cost of living. 

It appears then that, over the period for which we have data on family 
expenditures in the United States, there have been some marked 
changes in tendencies of family expenditure. If the proportions of ex- 
penditure devoted to home maintenance and to recreation and trans- 
portation have tended to increase, it is likely that the proportions 
devoted to food and to clothing have tended to decrease. Changes in 
relative costs have doubtless exercised considerable influence on the 
result. Detailed interpretation must wait on solution of some difficult 
problems of quantitative comparison of expenditure patterns at differ- 
ent times. 





APPLICATIONS OF STATISTICAL LAWS OF 
FAMILY EXPENDITURE 


Vv 


Section I above is essentially a summary of the main conclusions 
developed in the sections that followed. At this point it is appropriate 
to look at the significance of the findings from a different point of view. 
To what extent are the results of surveys of family expenditures appli- 
cable to the interpretation of current economic situations and to the 
anticipation of future developments? 

Tendencies in family expenditure do change with time and no studies 
yet made provide a reliable basis for appraising current changes or 
anticipating future changes. Until the requisite studies are made, 
changes must be allowed for on the basis of a priori judgments, unrelia- 
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ble as they may be. To such judgments the findings of the present paper 
may contribute appreciably. 

It is particularly noteworthy in this connection that the present find- 
ings support the view that the effects of changes in income through 
time may be judged fairly reliably from differences in expenditure 
patterns at different levels of expenditure at a given time. This is indi- 
cated especially by the fact that expenditure patterns differ little ac- 
cording to size of family, and that such differences as are observed are 
explicable largely on technical grounds. Expenditure per person de- 
pends to a large extent on size of family, which changes fairly rapidly 
through time for individual families. The expenditure patterns for 
families of different sizes might show differences resulting from recent 
reduction in expenditure per person incident on increase in the family 
size. The fact that differences calling for such explanation do not appear 
suggests that families allowed or compelled to change to a new level 
of expenditure per person tend to change quickly to the expenditure 
pattern characteristic of that level. 

Changes in income through time probably tend to have a pronounced 
effect on the proportion of income saved. Curves expressing the relation 
of expenditures in particular categories to total expenditure per person, 
therefore, are probably more stable in periods of changing income than 
are curves expressing the relation of expenditures in particular cate- 
gories to income per person. 




















CORRECTION FORMULAS FOR MOMENTS OF A 
GROUPED-DISTRIBUTION OF DISCRETE VARIATES 


By Joseprn A. Prerce 
Atlanta University 


certain frequency distributions of discrete variates, it is often 
desirable to consider a grouped-distribution which is obtained from the 
original distribution by an arbitrary grouping of the given data in class 
intervals of width k. The moments are then computed on the assump- 
tion that all of the items of each class are concentrated at the middle 
point of the interval. This assumption introduces grouping errors be- 
-ause the items of each interval usually do not all lie at the middle 
point of the interval. It is necessary, therefore, that corrections be used 
which compensate for the grouping errors. 
The general formula for correction for grouping errors of a distribu- 
tion of discrete variates which has been grouped in “k groupings of k” 
has been given by the writer! as 


[t/2] t 
Hi:zy, = > (25) url (1) 


s=0 


as OF THE LABOR involved in the computation of moments of 


where y.,, is the average of the t-th moments of the grouped-distribu- 
tions, u;~2; is the ({—2s)-th moment of the ungrouped data and Rg, is the 
2s-th central moment of a rectangular distribution of discrete variates. 
Usually it is desired to obtain the value of the moments of the un- 
grouped data but (1) solved explicitly for u, takes a complicated form? 
and is omitted from this paper. However, it may be readily seen that 
these correction formulas express the true moments in terms of the 
average grouped moments. Thus the corrections which are found are 
average corrections. But in actual problems we do not know the exact 
values of the averages of moments of the grouped-distributions and we 
approximate their values by utilizing the corresponding moments com- 
puted from a single, arbitrarily selected grouping. It is possible, in any 
instance, that the use of the approximation may cause a greater devia- 
tion between the “corrected” moment and its true value than would 
have resulted if no attempt were made to correct for grouping. Thus, 
while the systematic errors due to grouping are removed by the cor- 
rection formulas, the problem of eliminating the accidental errors, 


1J. A. Pierce, “A Study of a Universe of n Finite Populations with Application to Moment-Function 
Adjustments for Grouped Data,” Annals of Mathematical Statistics, Vol. 11 (1940), p. 326. 
2 Op. cit., p. 327. 
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introduced through the use of approximations for the averages of 
moments of grouped-distributions, has not been solved. Carver® 
pointed out the serious nature of this problem and recently Dwyer 
made an advance towards its solution by presenting grouping methods 
which give better “corrected” values of moments from a single group- 
ing than can be obtained with the use of the conventional correction 
formulas. 

It is the purpose of this paper to present new correction formulas for 
moments, computed from a single grouping, which yield the exact 
values of the moments of the ungrouped data. 


THE FIRST MOMENT 


Let the ungrouped distribution be the set of variates z;, ({=1, 
2, - +--+, 8), with the corresponding frequencies f;, ({=1, 2, - - -,s). The 
x’s are subject to the condition 2;,,—2;=h and consequently some of 
the f’s may be zero. The mean of this distribution is u, =) >2f/N where 
N=Syf. 

If the given distribution is grouped in a “grouping of k” and any z; 
is placed in the r-th position of a class, then the limits of the class are 
ai—(r—1)h and 2;+(k—r)h and the midvalue is z;+3[k—(2r—1) Jh. 
Therefore, when the midvalue is used as the value of z;, the quantity 
4[k—(2r—1) ]h is added to the true value of z;. For convenience, we 
define 


e, = $[k — (2r — 1)]Jh (2) 
F=f, + frok + frs2ktt cee (3) 


If v; is the midvalue of the 7-th class in the r-th grouped-distribution, 
the mean of the ungrouped data is 


and 


1 k-r 1 ¢c ik 
m=— Do (mn —eridfiat+— LD 2 (vi — edfsots (4) 
N ico N ind fm(i—-1) 41 
where c is the number of classes and ¢4,=ep, p=0,1,2,---,(r—1). 
Therefore 
1 k 
in. ee > F i€r43--1, (r - 1, 2, viet k), (5) 
N i=l 
or 
pi or, (r va 1, 2, iets k), (6) 


*H. C. Carver, “The Fundamental Nature and Proof of Sheppard's Adjustments,” Annals of 


Mathematical Statistics, Vol. 7 (1936), p. 171. 
‘4 P.S. Dwyer, “Grouping Methods,” Annals of Mathematical Statistics, Vol. 13 (1942), pp. 138-155. 
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where we define 


k 
o™ = a Do Fie"e4s-1 (7) 
i=l 
and ,4; is the first moment of the grouped-distribution in which 2; is in 
the r-th position of a class. 

It is obvious that we may form ¢,4:™ from ¢,™ by a cyclic permu- 
tation of the e’s of ¢,™. 

If Fi=F,=F3= -- - =F;,, it follows that ¢,“ =0 for all r. Therefore, 
if k can be selected such that all of the F’s are equal, then the mean of 
any grouped-distribution equals the mean of the ungrouped data. 

Also from (6) we are able to find the difference between the mean of 
any grouped-distribution and the mean of the ungrouped data without 
computing either mean. Therefore, if one desires to present the data 
in the grouping for which the mean differs least from the true mean 
then it is only necessary to select the grouping which corresponds to the 
smallest ¢,"). 

HIGHER MOMENTS 


The ¢-th moment of the ungrouped distribution is 


1 k-r 
m= TD (1 efi tO > (vi — es) fi-r41 (8) 


l=0 im? je(i—1)k+1 


which after some simplification, becomes 


1 t—1 i 
= (= Ime = 24 (= DH) De VE + (— I'd (9) 


j=l 
where V, is the sum of the j-th power of the variates which were 
‘ 


grouped in the i-th class and }> operates on the classes of the grouped- 
distribution. 
ILLUSTRATION 

We use the arbitrary distribution of Table I to illustrate the method 
of finding the moments of an ungrouped distribution from the moments 
of a grouped-distribution. We arbitrarily choose a class interval of three 
and hence the data of Table I are written in Table II in columns of 
three rows. The F’s, as defined by (3), are computed by simply adding 
the f’s of each row. Thus F; =33, F;=13, and F;=14. 

Since terms of the form }>v,'V; appear in the correction formulas, 
where V;“ represents the sum of the j-th power of the variates which 
fall in the 7-th class of the chosen grouped-distribution, we find it neces- 
sary to include columns of V,;“ in the grouped-distribution table. 
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TABLE I TABLE II 
Variate | Frequency z | z | ft | zx | f | F 
z f —— —-——— 
1 | 2 4 | 30 7 1 | 2+30+1=33 
1 2 2 8 5 4 s | 1 8+ 44+1=13 
2 8 3 | 10 6 3 9 | 1 | 10+ 3+1=14 
3 10 
4 30 
5 4 
6 3 
7 1 
8 1 
9 1 








Jones’ and Dwyer*® suggest that a column of V;“ be included but for 
accuracy in the /-th moment it is necessary to use V;‘‘~'). 

However, it is usually not necessary to find moments higher than the 
fourth and in many instances the first and second moments are suffi- 
cient. 

Although it is possible to form three groupings of three, in practice 
one obtains the moments from a single grouping. Formula (9) applies 
equally well to any grouping. Therefore, there is no loss of generality 
when we select the grouping in which the first variate is in the first 
position of the class. This grouped-distribution, together with V;“, 
j=1, 2, 3, is given in Table III. 











TABLE III 
a l 
Class | — | f vo y) | ya 
| | | 
1-3 | 2 |} 2 | 48 24 | 836 
4-6 5 37 158 688 3068 
7-9 8 3 24 194 1584 





The correction formulas through the fourth moment, as given by (9), 
are 





M1 


Ke 


3 


ba 


~ = ¢,")), 


2 
= — se + WV > vVis + ¢,, 


3 3 
= ots — — > 902V + — Vi Vi — 6, 
oe x N a , 


(10) 


(11) 


(12) 


4 6 4 
ee we _ 92), —— D 02V 52) +— + 7ViP+¢,. (13) 
N N N 


5H. L. Jones, “The Use of Grouped Measurements,” this Journax, Vol. 36 (1941), p. 529. 


*P. S. Dwyer, ibid., p. 154. 
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Before we can apply the correction formulas we must compute 
di, oi”, od, and ¢. We get 


1 19 
gi) = W fexFi: + e2F2 + e@3F3] = on’ 
gi = Ld [er?F’'1 + e2°F2 + e37F'3] = i 
N 60 
i) J [ aFy+ 3Fy+ 3Fs] 19 
= — [e e e; = — 
1 N 1 1 242 343 60. 
a” = bf [e:*F + e24F'2 + e3*F's] = = : 
N 60 
230 
We now substitute in the correction formulas and find ~—" 
1006 4988 28066 
bo =——> w3=——» wa =——-> which are identical with the mo- 
60 60 60 


ments of the ungrouped data. 
Comparisons between moments obtained by various methods may be 
made from Table IV. 























TABLE IV 
: — Moments - er i 

Dist. ma us us ma 

. | 230 | 10060 «| 4988 28066 

Ungrouped dist. | — | paca anlie siaeniien 
60 60 60 60 

: 249 1197 | 6321 35733 

Grouped dist., uncorrected = — | | —— 
60 60 } 60 60 

; 249 1057 5823 | 31465 

Grouped dist., corrected by usual formulas | — —_— —_— —_—— 
| 60 | 60 60 60 

230— | 1006 | 4988 28066 

Grouped dist., corrected by new formulas - = } mo | —— —— 
60 | 60 60 60 








LIMITS OF ERROR BETWEEN GROUPED MOMENTS 
AND UNGROUPED MOMENTS 


If the limits of error between the desired moment of the ungrouped 
data and the corresponding moment for any grouped-distribution are 
close, it may not be worth while to use the exact formulas of this paper. 

Norton’ gives formulas for finding the extreme errors in the moments 


7K. A. Norton, “Limits to the Accuracy of Estimated Moment Coefficients,” Sankhya, Vol. 3 
(1938), pp. 265-272. 
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of a continuous distribution. Formulas for the maximum difference be- 
tween yp; and ,u; for a distribution of discrete variates are derived below. 

The value of y:, denoted by u,*, for which u;—,p; is a positive maxi- 
mum occurs when the difference between v; and each variate in the class 
is the largest possible value, namely e:, and positive. In other words, 
Ht— re iS & pOsitive maximum when the variates of the classes are all at 
the upper limits of the classes. Therefore, 


— Zz (v5 +e) fi 
Dh 


Similarly, the value of u,, denoted by u;-, for which u;—,; is a negative 
maximum is given by 
— Li@-afi 


Ly > fi 


we S we S pet. 


t t t/t 
Y~(- »( ats Sus ie (; Yaueser (14) 


i=0 Vv 








and necessarily 


Therefore 


which, for the first two moments, becomes 


ria — 4) Sn S oi + 41, (15) 
rg — Zyeier + 1? S we S poe + Qyrer + 1”. (16) 


It follows that (14) may be used to determine the maximum devia 
tion of a given moment of the grouped-distribution from the corre- 
sponding moment of the ungrouped data. 


CONCLUSION 


If precision is not desired or if the limits of error between ,y,; and yy, 
are close, the conventional correction formulas (1) may prove satisfac- 
tory. But if additional precision is needed, the new correction formulas 
of this paper should be used. In order that these new formulas may be 
used it is not sufficient to record the results of grouping in the form of 
classes and frequencies The grouped-distribution should also contain 
F,, Fs, - - - , F, and columns of V;™. Usually it will not be necessary to 
go beyond V;. This additional information requires only a little more 
space and labor yet it makes it possible for others who compute mo- 
ments from the grouped-distribution to do so confident that their re- 
sults are accurate. It is to be hoped that this procedure will be followed 
by those who prepare grouped-distributions for general use. 








A SECOND MOMENT CORRECTION FOR GROUPING 


By G. R. Davies anp Nancy BRUNER 
University of Iowa 


rection in estimating the population variance of grouped sample 
data sometimes results in an unreasonably low figure, particularly 
when the sample distribution lacks “high contact,” that is, when the 
first and last frequencies are relatively large. This downward bias ob- 
viously arises from the fact that the correction theoretically assumes 
and corrects for classes from minus infinity to plus infinity, as in the 
case of the normal curve. 

Several attempts have been made to approximate and remove this 
bias, but none of the formulas proposed has come into general use.! 
This failure arises in part from the difficulty encountered in devising a 
smoothing procedure which is at once as plausible as Sheppard’s, and 
at the same time is convenient to use. An even greater difficulty is that 
of adapting the correction to a discrete scale having smaller class inter- 
vals than those of the given distribution, as is desirable when a measure 
like “persons per dwelling,” grouped to include a range of several per- 
sons, is to be reduced to discrete unit classes. 

It is the purpose of this paper to develop a correction which will be 
identical with Sheppard’s at the limiting case of a continuous scale and 
high contact, and yet will be adaptable either to an integral number of 
equally wide sub-classes, or to a continuous scale. At the same time the 
correction must not be unduly influenced by irregularities in the data, 
outside of the first and last frequencies, which necessarily measure lack 
of high contact. 

It will be found convenient to approach the problem by observing 
that Sheppard’s second moment correction in effect compensates for 
two distinct errors arising from grouping. The first of these two errors 
is the result of considering the squared class mark as representative of 
the average of corresponding squares in the subclasses. Thus a fre- 
quency of f, with class limits of 0 and 8, is usually assumed to con- 
tribute to the calculation of the second moment fm?=fX4?=16f. But 
if circumstances require that it should be broken down into two equal 
subclasses of one-half the original width, it contributes f(2*+6*)/2 
=20f, or, similarly, if into four subclasses, f(1°?+3?+5?+7?)/4=2lf. 

1 W. T. Lewis in “A Reconsideration of Sheppard's Corrections,” Annals of Mathematical Statistics’ 


Vol. 6, 1935, has derived a complex second moment correction somewhat similar in principle to that here 
developed. Also see the September, 1940, issue, pp. 324-332. 
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[’ IS A MATTER Of practical experience that the use of Sheppard’s cor- 
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The sum of squares, corrected for the grouping error thus illustrated, is? 


fm. = fm? Pee toh = 
12 k? 





(1) 


where f is the frequency, m is the class mark, 7 is the class interval, k is 
the integral number of subclasses in one class, and m,, is the class mark 
corrected for squares, that is, adjusted for subclasses as just illustrated. 
It is obvious that if the scale is continuous and k is assumed to be infi- 
nite the fraction (k—1)/k? disappears, and, in the above illustration, 
~ 2 82 
Sma = fm? +; — = 16f+ oa? = 21.33/. 


If the class intervals of a given distribution have the same width 
throughout, and if each class is broken down into the same number of 
subclasses, the aggregate correction for the whole distribution, with re- 
spect only to the squared class marks, is 

2 2 am 
Lo fm. = Dfm +N . (2) 
12 ik? 

It may be noted that if k of this formula is again assumed to be infi- 
nite, that is, if (k?—1)/k? is taken at its limit of unity, the correction 
term follows Sheppard’s second moment correction, except that the 
sign is positive, suggesting that for continuous, high-contact distribu- 
tions another correction, twice as great and negative, is required. The 
nature of this second correction will now be considered. 

The second correction for grouping inakes allowance for the error in- 
volved in the usual assumption that in each class of a distribution the 
frequencies are distributed equally throughout the class, instead of be- 
ing somewhat more numerous on the side of the class nearest to the 
mode. This may be called a correction for slope or height, inasmuch as 
graphically, on a continuous scale, it changes the flat top of the fre- 
quency rectangle into a curve approximating the smoother outline of 
the whole distribution. It will be seen that this correction is made sepa- 


? This formula is readily obtained as follows: When & is even, 
f i\? 2 i(k —1)\* 
L(= + 5x) +(™ . x) + +(n + Ok ) 
ae a, eee Oey 
——_—- 6h US a 


£ km? > a 32 + 52 + L (k — 1) : eS -». 
“7 wet +i +5 + (k — 1)?) = fm 3 


A similar arrangement for k odd gives the same result. 








-A Seconp Moment CORRECTION FOR GROUPING 65 


rately in each frequency, which distinguishes the procedure from the 
more radical one of curve fitting. 

Stated more precisely, the second correction applied to a given fre- 
quency involves a readjustment of the graphic height of each of the k 
subfrequencies in such a way that they conform to the outline of the 
distribution and yet retain in the aggregate the original area of the 
frequency rectangle. The plotted subfrequencies in the accompanying 


CHART I 
CLASSIFICATION OF CERTAIN SHOPS ACCORDING TO NUMBER OF 
MECHANICS EMPLOYED (SEE TABLE I) 


Frequencies reported in classes of three, and reduced to unit classes by parabolic interpolation. 
Plotted subfrequencies, read on Y scale, should be divided by k =3. 
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chart illustrate the nature of the adjustment. This graduation of the 
subfrequencies may be most conveniently accomplished by making 
them conform to certain parabolas about to be described, inasmuch as a 
normal or logarithmic normal curve could be closely approximated by 
a series of suitable short parabolas. Such parabolas may be constructed, 
one within each class, to form a connected series beginning and ending 
with zero height (Y=0) at the limits of the distribution, each two ad- 
jacent parabolas meeting at the average height of the two frequencies 
they respectively represent. Each rectangular subclass is then given the 
same area as that under the portion of the parabola having the same X 
limits. The height of the parabola may then be adjusted to pass through 
the central ordinates of the given class subfrequencies at the graphic 
height of each f,. 

Any one of the parabolas thus adjusted may be defined in terms of 
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the given frequency (f) of its own class, the preceding frequency (f_:), 
and the following frequency (f4:) by the equation 
W-faw~-fsaP—1  farfs 3(fxr + fir — 2f) , 


y’ = - ad 3 
8 oe a 2i2 oe 








where « has its origin at the class mark of the given frequency, and the 

“base” (Y’=0) of the parabola is f. If k is infinite, the integral of Y’dz 

between the class limits r= —7/2 and x= +7/2 will then equal zero, as 

required. 

Multiplying each squared subclass mark (m,?) in a given class by its 
subfrequency (Y’,/k), summing and simplifying, give the total, 

D ¥.'mz/k = f = (2 $< ~<*) cet 

e M,° = IMA = m = —. — 

: k? 12 k* 








This total embodies the correction for slope (graphic heights of subfre- 
quencies) and for the class-mark squares in so far as they enter into the 
modification of height. The two corrected total squares, (1) and (4), 
may now be combined, and the frequency values of b and ¢, as indicated 
in (3), substituted. The result is a corrected total of squares for any 
class except the two at the extremes of the distribution, as follows: 


m 2f—fi—fis |< ¢? k?—1 
Ne? = 2 — —-—— ( = a . . 
fm? =fm +1 ; fa-J+1) = = in Be 





(5) 


While a formula for the total corrected squares might be obtained more 
directly by expanding >> Y, m,?/k, where Y, expresses the full graphic 
height of the subfrequencies, the method here adopted, involving a 
separate expression for each of the two errors, leads directly to a form 
easily adaptable to further manipulation. 

In totaling the squared class marks for all classes, the frequencies 
may be successively denoted by fa, fs, f- - - - fz. These symbols may be 
substituted for f_; and f,; in the appropriate classes. In the first class it 
will be necessary to take f_; as —f,, and in the last class f,; as —f,, in 
order that the series of class parabolas may meet the condition that 
they should begin and end at Y =0. The total for all classes may then 
be reduced to the form, 


4 
fm? = >-fm -[¥ ~- (fa + fe) 


(6) 








k? + 4 i? k? — 1 


em S12 


If the scale is considered to be continuous, and if subclasses are not ap- 
propriate, this formula obviously becomes 
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2 


4 
DLfm2 = Difm* - lv =? i. (7) 


12 
And if high contact is assumed, f, and f, disappear and the formula is in 
agreement with Sheppard’s correction. 


TABLE I 


CLASSIFICATION OF CERTAIN SHOPS ACCORDING TO NUMBER OF 
MECHANICS EMPLOYED (i=3), WITH DIRECT COMPUTATION 
OF SUBFREQUENCIES IN UNIT CLASSES (k =3), AS DETER- 
MINED BY PARABOLIC INTERPOLATION* 








Class k Xsub- Subclass Summed 








Class mark Frequency Parameters of frequency mark squares 
™ f parabola kf, Ms kigm,? 
{a= 22.6 9.3 1 9.3 
1- 3 2 20 b= 9.3 22.6 2 90.6 
c=—4.0 28.0 3 252.0 
a= 39.1 33.7 4 540.4 
4- 6 5 36 {b= 0.6 39.1 5 977.7 
\ c= —4.6 35.1 6 1264.0 
a= 24.0 28.0 7 1372.0 
7-9 8 24 { b=—4.0 24.0 8 1536.0 
c= 0.0 20.0 9 1620.0 
(a= 11.3 15.3 10 1533.3 
10-12 11 12 { b=—3.0 11.3 11 1371.3 
c= 1.0 9.3 12 1344.0 
a= 5.7 fe 4 13 1314.4 
13-15 14 6 { b=—1.6 5.7 14 1132.4 
c= 0.3 4.4 15 1000.0 
a= 2.0 3.3 16 853.3 
16-18 17 2 b=—1.3 2.0 17 578.0 
| e= 0.0 0.6 18 216.0 
3)300.0 3)17005.1 
N =100 N =100.0 Lfem,? = 5668 .370 





Graphic height of subfrequencies (x =m, —m) 


2f —fir —Sus kt—1 Sus —fiu 3(fa +fii-2f) 
———* pe 


8 kt 2 23? 


¥,=kf, =/+ 








Lim = Xfm, =662; M =662/100 =6.62 


B+17e B-1 
em J12 #& 





4 
Zfme = Tfm*— [ ¥- 7 +s) 


4 1079 8 
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Since it may be shown that >-fm requires no correction for slope, the 
estimated population variance is simply 


o¢ = a |x fm? — (do fm)*/N (8) 
N-1 

In order to illustrate the theoretical steps implied in parabolic cor- 
rection for grouping, the accompanying table is presented, and the data 
and computed subclasses are charted. The data of the table may be 
assumed to represent a sample of 100 smal! shops classified according 
to the number of mechanics employed. The original data are grouped 
in classes of three; i.e., shops having 1 to 3 mechanics, 4 to 6 mechanics, 
etc. An estimate of variance is required, based on three subclasses in 
each class. It will be noted that the result obtained by the tedious esti- 
mate of probable subfrequencies and the totaling of squares of sub- 
class marks, duplicates that obtained by the derived correction 
formula. 

Very irregular distributions, such as bimodal, if similarly treated, 
would not always present a plausible smoothing in detail. In particular, 
when an extreme frequency (f, or f,) is relatively very small, parabolic 
interpolation may not be literally plausible, since a portion of Y in the 
class may be negative. But the use of the parabolic residual, f.+f., in 
the correction formula, is plausible, since relatively small extreme fre- 
quencies, as in the case of the binomial (0.5+0.5)" with n large, imply 
an approach to the high contact which Sheppard’s correction requires. 








PRACTICAL APPLICATIONS OF THEORETICAL 
SAMPLING METHODS* 


By Joun N. Wess, M. Starr Norturop, 
AND STANLEY L. Payne 
Bureau of the Census 


N A RECENT popular novel, The Just and the Unjust, two attorneys 
| are discussing the difference between the theory and the practice of 
law. One of the attorneys draws the distinction in terms that assume 
an extremely high level of professional competence: “Theory,” he says, 
“is where you want to go; practice is how you are going to get there.” 
The other attorney, denying that a high level of professional compe- 
tence can be assumed, states the distinction in these terms: “Theory is 
what you tell people you are going to do; and practice is what they 
catch you really doing.” 

In this paper on some practical applications of sampling methods, it 
would be pleasant to take the first distinction between theory and prac- 
tice as applying to statisticians as well as to attorneys. Were this as 
possible as it is desirable, we could take leave of sampling theory at this 
point with the remark that it provides us with the rules and the blue- 
prints of our operations. 

But some of you must have wondered from time to time whether 
statistical surveys in the service of marketing, or government, or opin- 
ion measurement, to mention only a few, can actually live up to the 
specifications laid down by sampling theory and still get the job done 
on time and within the limitations of field resources. When, for instance, 
you see a statement in a commercial report, a newspaper, or a mono- 
graph that the findings presented therein to one or two decimal places 
have an accuracy such that 95 or 99 times out of 100, the findings of the 
sample survey would not differ from the actual facts by more than 
some extremely small value, have you not sometimes marvelled at the 
high precision we statisticians have achieved in recent years? 

Sampling theory does provide us with simple rules by which we can 
compute the reliability of our findings, provided certain procedures 
were actually carried out; and there is no lack of zeal in following theory 
on this point. The conditions under which the computation of reliability 
is valid are by no means easy to realize, particularly in sample sur- 
veys of social and economic phenomena. Moreover, practical adjust- 
ments in terms of field needs and dead-lines often complicate the prob- 


* This paper was prepared for presentation at the 104th Annual Meeting of the American Statistical 
Association which was to have been held at Cleveland, Ohio, December 29 to 31, 1942. 
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lem of measuring the precision of the results, so that it is possible to 
make only broad statements about the reliability of the results. 

Let us take, for example, the apparently simple requirement that 
every observation in the universe to be sampled must have an equal 
chance of being selected. Now, if our observations are numbered cap- 
sules in a selective service drawing, or cards in a file drawer, or colored 
marbles in an urn, this condition should be relatively easy to fulfill, 
although even then there may be unexpected difficulties. But if our 
universe consists of vacant dwelling units, or employed or unemployed 
workers, or passengers boarding intercity trains and busses, how are we 
to determine who or what makes up our total universe so that we can 
insure every item an equal chance of inclusion in our sample? To list 
all dwelling units in a community as a preliminary step to sampling is 
obviously ridiculous, especially when the listing would cost more than 
is allowed for the entire survey. Equally, to make a card index of all 
workers so that we may sample proportionately the employed and the 
unemployed is to take a census in order to take a sample. 

Thus, the rules of sampling, however easy or difficult to devise when 
the universe is ordered and static, become extremely difficult of realiza- 
tion when the universe is disordered and dynamic. In many cases the 
theoretical conditions of sampling cannot all be met within the limits 
of time and cost, and resort must be had to procedures that we believe 
approximate the conditions desired. How far the sampler may safely 
depart from strict conformance to established rules is a matter for his 
competence and conscience to decide, because there is seldom any abso- 
lute check on the findings. 

The sampler of social and economic data must improvise procedures 
from his knowledge of methods used in other fields and frequently 
must pioneer in developing new means of insuring sufficient reliability 
of results to meet administrative needs. Under the urgencies and incon- 
veniences of the war program, he faces more and more difficult operat- 
ing problems but by the use of ingenious sampling devices he continues 
to produce the necessary information. More than ever he must take 
practical considerations into account in devising methods without over- 
looking the necessary theoretical considerations. The concrete examples 
that follow show how theory and practice are brought together in two 
types of sample field surveys now in operation. 


VACANCY SURVEYS 


The urgent need for current information on residential housing for 
in-migrant workers in war industries provides the first illustration. 
Consider a small city near some war activity—a city in which the War 
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Production Board, the National Housing Agency and the War Man- 
power Commission are concerned about the amount of housing avail- 
able for an expected influx of war workers and their families. Let us 
assume that activity is well under way and that already owners of 
residential property have found it their profitable and patriotic duty 
to convert large one-family structures into rebuilt apartments. Other 
landlords are breaking up their small apartments into separate rooms 
for rent. Thus, at the same time and for the same reasons, new small 
family dwelling units are being created from large residences and other 
small family dwelling units are being eliminated. It is difficult to de- 
termine in advance whether the family dwelling unit inventory is 
expanding or contracting under these circumstances. 

Current information on dwelling unit vacancies is urgently needed in 
order that the city may be allowed its necessary number of public 
housing projects and that its private builders may receive their share 
of materials for new construction. The sampler is asked to design a 
method of obtaining the vacancy rate with a margin of error not too 
great for the purposes of the agencies administering the construction 
program. As usual, the findings are needed “day before yesterday” but 
they will still be of some value, let us say, four weeks from now. The 
agency requesting the data feels justified in spending less than one-fifth 
of the cost of one dwelling unit on this one survey. Furthermore, it 
reasonably expects the findings in this city to be comparable with those 
in more than 100 other cities where similar surveys are to be made, in 
order that the supply of housing materials may be distributed judi- 
ciously. Such are the conditions that face the sampler when he ventures 
to design the sampling method. Such conditions are not set forth in any 
elementary textbook nor in this particular combination even in ad- 
vanced books on sampling theory. Since the sampling method he em- 
ploys has never appeared in print it is up to the sampler to develop his 
own theory. 

Let us suppose that the city in question is known to have some 12,000 
dwelling units, that the expected vacancy rate is in the neighborhood 
of 2 per cent of all the units, and that the sampler is allowed a standard 
error of approximately 10 per cent of the reported vacancy rate. Refer- 
ence to elementary probability theory will tell him that he could take 
arandom sample of 3,500 units or about 3 in every 10 dwelling units in 
this city, but the theory does not indicate how randomness of selection 
is to be assured. An obvious device would be to select 3 in every 10 
addresses from an up-to-date city directory. But even an excellent city 
directory cannot be kept up-to-date under wartime conditions; and if 
such a directory were available this device might still be impractical 
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from the cost angle. Were it not for the time, the cost, and the problem 
of dwindling manpower, the sampler could make his own complete and 
current inventory from which the sample might readily be selected. 
Faced with the realistic problem of providing the information in less 
than a month at a cost of somewhat under $1,000 and having no current 
city directory in which he places reliance, our sampler instructs his 
local supervisor as follows: 

He is to obtain the most recent map of the city and bring it up to 
date by conferring with the city engineer on new streets which have 
been cut through and boundaries which have been changed. He is then 
to subdivide the map into areas as homogeneous as possible, using as 
a first approximation Census tracts, wards, school districts or other 
recognized local divisions. Then he calls upon one of the city fathers to 
assist him in further subdividing these areas into white and Negro 
neighborhoods, the rooming house section and other important types 
of area. If the supervisor had the time, he might make a scientific 
stratification using the Real Property Inventory, for example, but the 
sampler explains that the old resident’s idea will serve the purpose 
almost as well. The supervisor is then to number seriatim all the blocks 
within each sub-area, carrying over the numbering from one sub-area 
to the next. Finally, he is to select the sample blocks by simply taking 
every other one of the numbered blocks. This sampling ratio will, of 
course, give a larger number of dwelling units than would have been 
covered in the hypothetical 3 in 10 random sample prescribed by the 
textbook; but more important, it gives a smaller number than would 
have been included in a complete inventory of the city. 

Next, the local supervisor proceeds to have his enumerators make an 
inventory of dwelling units in these sample blocks, ringing each door- 
bell, and investigating all alleys and passageways in their search for all 
units whether in converted attics, in basements, or over garages. As 
the enumerator makes the inventory of a given block, he lists the perti- 
nent information for each unit which he discovers to be vacant (an 
average of only one vacant unit in every 50 according to the original 
assumption), listing the number of rooms, the rental asked, the condi- 
tion of the structure and the facilities included in the dwelling unit. 
Thus, when he has completed his inventory of a sample block, he has 
also completed the count and description of the vacant units within 
that block, at little more than the cost of a simple listing of addresses. 

Experience has taught the sampler that the block sample, which is an 
areal sample drawn from a horizontal plane, might, by chance inclusion 
of too many large apartments rising above the horizontal plane, seri- 
ously increase the apparent vacancy rate of the sample over the true 
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vacancy rate. This is especially true of rates for rental vacancies since 
the large apartments frequently have high turnover of tenants and 
because all the apartment units are rental units. To overcome this dif- 
ficulty, the local supervisor is asked to exclude the large apartments 
from the block sample and to investigate the apartments in all large 
apartment buildings whether they fall in the block sample or not. This 
is relatively easy because some one person in the apartment house, the 
manager or the janitor, can usually give all the necessary information 
on the vacant units. Results for the block sample are weighted up by 
the sampling ratio and results for the large apartment structures are 
added in to give the combined vacancy rate for the city. In this way the 
administrative conditions are met within the bounds of sound theory. 
Each block has an equal chance of representation in the sample; densely 
populated and sparsely populated areas are represented in the same 
proportions. 

The important decision in using the block sample method is the esti- 
mate of the efficiency of the block as a sample unit in comparison with 
that of a random sample of addresses. This estimate is difficult to make 
for the first survey in a given city, but the approximate number of 
blocks, the number of dwelling units, and the April 1940 vacancy rate 
are helpful in typing the city for this purpose. For use in later surveys 
of this city investigation of the results of the first survey by blocks gives 
an excellent estimate of the efficiency of the block sample. A great boon 
to the sampler in this connection are the tabulations of block statistics 
from the 1940 Census now available for all cities over 50,000 popula- 
tion. Not only is it now possible to use a locational stratification but also 
to employ other means of stratification, such as race of occupants, age 
of structure, and average monthly rental. 


LABOR MARKET SURVEYS 


The sampler’s problems are not limited to designing plans for housing 
surveys alone. In this period when the Nation needs every available 
man and woman to serve on either the front line or on the production 
line, timely statistics on manpower are needed on a current basis for 
use in planning the war effort and in relieving labor shortages. How 
many civilians are now working? At what occupations and industries? 
Can any of these be shifted to more important jobs? What is the size 
of the unemployed labor reserve? Who are the unemployed and what 
can they do? Is the present labor force large enough to do the job ahead, 
and if not, what is the labor reserve of nonworkers who can and will take 
jobs if the need arises? What are the work experience and skills of 
persons in this labor reserve; will they need training and what kind? 
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If married women come into the labor market, will day nurseries have 
to be established? 

Some or all of these questions require immediate answers for the 
guidance of persons dealing with the manpower problem. Results may 
be required for a given city, a county, a local labor market area, a State, 
a geographic region or the Nation as a whole. A complete census of the 
United States could provide answers to all of these questions for all 
types and combinations of geographic areas, but as in the case of the 
housing surveys the results are wanted day before yesterday. Informa- 
tion about employed workers might be obtained by a canvass or ques- 
tionnaire directed to all employers or a sample of employers. 

At this point, however, the sampler begins to ponder the problem of 
establishing his universe of all corner grocers who hire help, of all 
housewives who hire servants, of all home owners who hire the neighbor 
boy to mow the lawn, and so on. Even if he solves these problems, and 
he probably could, he immediately realizes that he would not be able to 
answer all of the questions raised above. The employer cannot tell how 
many unemployed there are in his community, nor can he furnish any 
information on how many housewives or high school students would 
take full-time jobs in industry. It soon becomes apparent that, to be 
complete, a labor force study must include information on all types of 
persons—employed, unemployed, and nonworkers. As mentioned above 
employers’ records would provide at least a partial listing of employed 
workers, but there is no such list of the unemployed in any city, county 
or State. Work relief agencies and employment offices could provide 
partial lists of the unemployed, but these would be incomplete. Outside 
of school records, there are practically no listings of nonworkers. 

At this point the sampler has ruled out the complete census as being 
too costly and time-consuming, but he has also discovered that there 
are no tailor-made lists of his employed, unemployed, and nonworker 
universes from which he can sample. As was pointed out previously, 
to prepare complete lists or card indexes would be too costly and time- 
consuming. Regardless of whether the study is to cover a city, county, 
State, or the whole country, the required data must be obtained from 
a direct enumeration of a cross-section of the population within the 
primary sampling unit. The “Monthly Report on the Labor Force” 
survey, conducted each month by the Bureau of the Census,! covers a 
national sample in which the primary sampling units are counties and 
the secondary sampling units are households (dwelling units). The 
procedures followed in selecting the subsample of households within 


1 See Sampling Procedures and Method of Operation of the Monthly Report on the Labor Force, 
U. 8. Department of Commerce, Bureau of the Census, November 1942. 
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these counties serve to illustrate some of the practical approaches to 
problems of sampling for labor market data. 

The sampler has worked out a carefully designed stratified sample of 
counties. He has been informed of the phenomenon that is to be meas- 
ured; and taking into account funds available, field operating problems, 
and estimates of the sampling error, he has determined the total number 
of observations to be made. The total number of observations are 
broken down into county quotas. In turn, the quota for each county is 
assigned to open country areas, cities, towns, etc., in proportion to the 
total population of the county residing in those places.* So far, so good 
—all of the preliminary work has been painstaking and very important, 
but it has all been done by an “expert IN Washington.” Now comes the 
job of deciding how a local supervisor and finally an enumerator out in 
county X will locate and interview a sample of, say, 300 households that 
will represent all types of situations according to the detailed quotas 
established for that county. Households must be sampled from among 
those prosperous farms at the northern part of the county; the poorer 
farmers in the southern end must also be represented; quite a large 
quota is in “the city”; those five or six small incorporated towns must 
be covered; and finally the unincorporated towns—‘“the wide places in 
the road”—cannot be overlooked. 

In cities and towns for which maps can be obtained the local super- 
visor proceeds as follows: 


1. An accurate map of the city or town is obtained, checked, and 
numbered in the same manner as outlined above for the housing 
survey. 

2. A block quota is obtained by dividing by three the total number 
of households to be visited. This means that on the average three 
calls will be made in each block, a consideration that is important 
in cutting down travel time and cost, and also in saving gasoline 
and tires. 

3. The block quota is then divided by the total number of blocks in 
the city. If this comes out to 20, then every 20th block in the city 
is designated as a sample block. This guarantees geographic 
coverage of all sections of the city. 

4. Block descriptions, in terms of street boundaries, are then tran- 
scribed from the map to block lists. 

* Another plan for assigning a quota within the primary sampling unit (e.g., county) of a stratified 
sample is to make allocations in such a way that the observations will represent the population of the 
entire stratum from which the primary sampling unit was selected, rather than the population of the 
primary unit alone. To cite a simple example, assume county X is selected to represent a given stratum, 
and that 20 per cent of the population of this county lives on farms. Assume further that for the total 


population of all counties in this stratum, 25 per cent of the population lives on farms. Then for this 
county 25 per cent, rather than 20 per cent, of the county quota would be assigned to farm households. 
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5. Enumerators are sent to each sample block, and on a separate 
line of the block list form they record the address of every dwelling 
unit in the block, whether occupied or vacant. 

6. When all block lists are complete and checked, a count is made of 

the totai number of dwelling units in the total of all sample blocks. 

This count is then divided by the number of dwelling units to be 

sampled in this city to provide a dwelling unit ratio. 

8. The dwelling unit ratio is then applied to the block listings of 
addresses. If the dwelling unit ratio is 10, then the 10th, 20th, 
30th, 40th, etc., dwelling units listed become the households to 
be included in the sample. By this device, blocks that contain 
only a small number of dwelling units may have possibly one or 
no addresses included in the sample. On the other hand, congested 
blocks in tenement or apartment house districts may have 10 or 
15 addresses designated for the sample. 


a 


In small towns and unincorporated places for which no maps can be 
obtained the local supervisor proceeds on the basis of one of the follow- 
ing alternatives: 


1. He draws a rough draft map showing street names and other land 
marks, and proceeds as outlined above; or 

2. He omits steps 1 through 4 as outlined above, has his enumerators 
block list all blocks in the town; and proceeds with steps 5 
through 8. 


It should be noted that the block lists as described above should be 
carefully filed after the sample is drawn. These provide an immediate 
source of additional addresses from which other samples can be drawn 
without further expense. In general, however, block lists that are a 
year or more old should be brought up to date by a canvass of the 
blocks so that newly constructed units can be added and eventually 
sampled. 

For open country areas, including very small unincorporated places 
or cross-road clusters of dwelling units, the sampler proceeds as follows: 


1. The total open country quota for the county is divided up (in 
units of 5) between townships or similar minor civil divisions in 
proportion to the open country population living in each. 

2. A county map showing roads and highways is obtained. If the 
pattern of roads does not already break up the map into mile 
square sections, it is ruled in pencil vertically and horizontally at 
1 mile intervals to accomplish this same result. 

3. Within each township these sections are numbered consecutively. 

4. For every 5 units in the quota for a given township, 3 sections are 
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designated for inclusion in the sample by taking every nth section. 
For each section so designated, a starting point is marked arbi- 
trarily with an X. 

5. At this stage, the map is sent to the field for use by the local super- 
visor. 

6. To determine the open country sample for a certain township in 
the county, the supervisor proceeds to the lowest numbered sec- 
tion on the map, locates the starting point, and traveling around 
the section in a counter clockwise direction, writes a descriptive 
address for each of the first five dwelling units (either side of the 
road) that he comes to. If a section does not provide 5 households, 
he proceeds to the section with the next higher number, etc., until 
the quota for the township is reached. 

The procedures illustrated immediately above for labor market 
surveys are based on the assumption that fixed schedule quotas can 
be assigned to the primary sampling units. This is usually possible 
where current population figures are known. At the present time, how- 
ever, population migration as a result of war and other conditions has 
become so pronounced in certain areas that the sampler will be hesitant 
to establish fixed quotas on the basis of population figures that do not 
reflect recent increases or decreases. This problem may be met by 
establishing fixed sampling ratios rather than quotas. For example, a 
town having 1,000 households in 1940 might warrant a sample of 50 
households or 1 in 20. Thus the quota might be established by this rule: 
take every other block and every 10th dwelling unit in each sample 
block. If there has been no population change the sample should be 
close to expectations, i.e., 50 households, but if the population has 
doubled since 1940 this ratio quota should provide a sample of 100 
households. Ratio quotas can be adapted to the techniques described 
above with very few changes. 

The actual application of sampling designs as described above for 
housing and labor market surveys illustrates only a few of the prob- 
lems that must be faced in connection with the planning and operation 
of sample surveys. The sampler must foresee and take account of 
practical field problems in arriving at a sampling plan. Regardless of 
how nearly perfect a sampling design is from a theoretical standpoint, 
it must be discarded if the field staff cannot take over actual operation 
on a practical basis, and within necessary time and cost limits. Both 
theoretical and practical considerations must be given full weight; 
there will often be compromises between the two, but neither should be 
disregarded in developing a sound theoretical sampling plan that can 
be put into operation on an efficient basis by nontechnical field super- 
visors and enumerators. 











SOME SAMPLING USES OF DATA FROM THE CENSUS 
OF AGRICULTURE* 


By Irvin Hotmest 
Bureau of the Census 


HE 1930 census of Agriculture provided, among other things, a 
"T ‘ctacsification of farms into 13 major types and 5 subtypes. This 
classification by type of farm was based primarily upon the value of 
farm products sold, traded, or used by the operator’s family. It was 
used for making type-of-farm tabulations by counties and states of 
many agricultural items, such as acreage of land in farms, value of 
farm land and buildings, receipts from boarders, lodgers, etc., numbers 
of specified kinds of livestock and livestock products, numbers of farms 
by size, numbers of farms by value of products, numbers of farms by 
tenure of operator, and expenditures for specified items, as well as the 
value of farm products. As a result of the experience gained in this 
work, changes were made in the county and state tabulations of value 
of farm products data for the 1940 Agriculture Census as follows: The 
number of questions on value of farm products was increased from 6 to 
10 to provide more detail regarding sources of income; classification of 
farms by major (single largest) source of income was substituted for 
classification by type in order to provide a more objective basis for 
coding and, thereby, reduce editing costs; and the value-of-farm-prod- 
ucts data were tabulated for farms classified by total value of farm 
products in order to show the principal sources of income of farms at 
different gross farm income levels. 

County and state tabulations were not made for farms classified ac- 
cording to major source of income, or according to total value of farm 
products, for the additional agricultural items mentioned earlier in con- 
nection with the 1930 Agriculture Census tabulations. Instead it was 
decided to use sample data as a basis for presenting statistics on the 
characteristics of farms classified according to total value of farm prod- 
ucts and according to type (major source of income). Four considera- 
tions contributed mainly to this decision: The urgent need for quickly 
available figures in connection with war programs, particularly statis- 
tics on marginal (low income) farms and on farm labor; the necessity 
for holding costs to a minimum; the greater variety of tabulations that 

* This paper was prepared for presentation at the 104th Annual Meeting of the American Statistica 
Association which was to have been held at Cleveland, Ohio, December 29 to 31, 1942. 


t The author acknowledges the valued assistance rendered by Morris H. Hansen, William N. 
Hurwitz, and Hilton E. Robison of the Bureau of the Census. 
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could be made on a sample basis than on a complete basis with the funds 
available; and the possibility of securing experience and information 
which would be valuable in preparing plans for future censuses, either 
on a complete or on a sample basis. The purpose of this paper is to re- 
view the work that has been done to date on these tabulations of sample 
data from the 1940 Census of Agriculture. 


SAMPLE DESIGN AND TYPE OF SAMPLING UNIT 


The primary objective of these tabulations of sample data was to 
secure national statistics on the characteristics of farms classified ac- 
cording to total value of farm products, according to major source of 
income, and according to color and tenure of operator. The secondary 
objective was to provide measures of the variation in these statistics 
by large geographic regions. All of the data for one farm for a group of 
related items, such as value of farm products, are on a single punch 
card. Consequently, it was feasible to use the smallest available sam- 
pling unit, that is, the individual farm. Most previous studies of sampling 
techniques for proposed sample censuses of agriculture have centered 
about the problem of size of sampling unit; i.e., the problem of balanc- 
ing enumeration difficulties and costs for small sampling units, such as 
individual farms, against the decrease in precision resulting from the 
use of larger sampling units, such as clusters of farms. 

The number of farms, according to the 1940 Agriculture Census, 
varies from 135,190 for the New England Division to 1,090,574 for the 
West North Central Division. It was recognized that, under these 
conditions and with the objectives as stated, it would be highly de- 
sirable to vary the sampling ratio of number of farms for each geo- 
graphic area for which statistics were desired. Unfortunately, adminis- 
trative considerations made this procedure impractical. It was also 
recognized that for tabulations of farms classified by major source of 
income, the sampling ratio should be varied by major-source (type) 
groups. Likewise for any tabulations of farms classified by total value 
of farm products, as well as any tabulations by other sorts which in- 
volved value-of-farm-products data, it would be desirable to vary the 
sampling ratio of number of farms by value of farm products groups. 
The same administrative limitations prevailed here; consequently, the 
following procedure was substituted: 

The chief interest was in tabulations involving farms classified by 
total value of farm products. The farms with $10,000 and over total 
value of farm products represented only 1 per cent of all farms, but 
accounted for approximately 17 per cent of the gross farm income for 
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the United States in 1939. Accordingly, the cards for all farms with 
$10,000 and over total value of farm products were sorted out and 
tabulated completely.! 

The 1940 Farm and Ranch Schedules had been numbered by machine 
consecutively within each county prior to the time that the data were 
placed on punch cards. Before numbering, the schedules were arranged 
by minor civil divisions within each county. For the most part, the 
schedules were in order by enumeration districts within each minor 
civil division; and to a considerable extent, the schedules for each enu- 
meration district remained in the original order of enumeration. The 
punch cards for the sample farms were selected by machine by sorting 
for each county all cards with serial numbers ending in 15 and 65. The 
same terminal digit was selected to reduce the number of cards 
handled in the sorting. The use of these serial numbers excluded 22 
counties from the sample and resulted in an oversampling of the 
counties with small numbers of farms, particularly those with less than 
approximately 100 farms each. If serial numbers ending in 25 and 75 
had been used, the oversampling of the small counties would have been 
reduced, but the number of counties excluded from the sample would 
have been increased to 28. For a national and a regional sample, neither 
of the above defects appears important. At the time of the 1940 Census 
of Agriculture, there were only 64 counties with less than 100 farms 
each, or approximately 2 per cent of the total number of counties in 
the United States. Furthermore, of the 22 counties excluded from the 
sample, 10 were independent cities in Virginia. 

The sample drawn has, therefore, a primary stratification into 2 
classes by total value of farm products per farm in 1939. For farms with 
$10,000 and over total value of products, a 100 per cent sample of farms 
was taken. For farms with less than $10,000 total value of farm prod- 
ucts, a 2 per cent sample of farms was selected. The secondary stratifi- 
cation for the latter group was geographic, that is by counties and, to 
a considerable extent, by minor civil divisions within each county. 
Furthermore, to the extent that the original order of enumeration was 
preserved in numbering the schedules, the 2 per cent sample of the low 
income farms approximates a selection along the enumerator’s route 
of every 50th farm in that category. 


SCOPE OF TABULATIONS 
At the close of 1942, the following tabulations on a sample basis were 
completed or in progress: 


1 Actually, sample cards were also selected for farms with $10,000 and over total value of farm 
products. For certain tabulations, these sample cards have been used instead of the complete tabulations 
because of other considerations. 
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(1) Tabulations of numbers of farm laborers, expenditures for farm 
labor and other specified items, acreage of land in farms, value of farm 
land and buildings, work off farm, age of operator, year of occupancy, 
farm machinery and facilities, cooperative buying and selling, and 
numbers of specified kinds of livestock for farms classified by total value 
of farm products, 1939. 

(2) Tabulations of acreage of land in farms, value of land and build- 
ings, farm machinery and facilities, work off farm, and specified farm 
expenditures for farms classified by major source of income, 1939. 

(3) Tabulations of value of farm products sold, traded, or used by 
farm households, 1939, by sources of income; and total value of farm 
products by value groups and by major-source groups for farms classi- 
fied by color and tenure of farm operator. 

The tabulations described under (1) were being made in cooperation 
with the Bureau of Agricultural Economics and the Farm Security 
Administration of the Department of Agriculture. State tabulations of 
these data have been furnished to these agencies for administrative use 
in connection with problems relating to farm labor and marginal farms. 
Preliminary reports, or press releases, are also being prepared showing 
figures for those items by geographic regions for which the expected 
error is small enough to justify publication. Present plans are to publish 
the results of all three of these series of tabulations in a special mono- 
graph. 


SOME COMPARISONS ON COSTS OF SAMPLE TABULATIONS 
AND COMPLETE TABULATIONS 


The cost of pulling and coding the 2 per cent sample and making one 
tabulation run by states has been estimated, for this work, at roughly 
one-sixth to one-eighth of the cost of making a single tabulation run 
by counties for all farms. In both cases, the cost of reproducing the 
codes from one card to another has been included; and the cost for the 
sample tabulation includes the complete tabulation for the farms with 
$10,000 and over total value of farm products. The cost of coding and 
tabulating these large income farms exceeds the cost of pulling, coding, 
and tabulating the 2 per cent sample cards for farms with less than 
$10,000 total value of farm products. The cost of making an additional 
run of the sample cards on a state basis, after the sample cards have 
been drawn and coded, is estimated in the neighborhood of 1 per cent 
of the cost of making a single tabulation run by counties for all farms. 
All of these comparisons relate only to costs for machine tabulation and 
do not include costs of preparing the data for publication. The technical 
and clerical costs of processing these sample data have proved to be 
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materially higher, on a per unit basis, than the corresponding costs for 
mass processing of complete tabulations. Additional experience will 
probably result in lowered clerical costs on this phase of the work; but, 
if the Bureau of the Census is to render extensive service in this type 
of work, an adequate technical staff should be provided. 


METHODS OF EXPANDING THE SAMPLE 


For tabulations (1) and (2) previously described, two factors were 
available for expanding or inflating the sample. Published figures were 
available for the total number of farms and total value of farm products 
for each value-of-farm-products group? and also for each major-source- 
of-income group. For the tabulations outlined under (3), only one 
expansion factor was available since the only known values were the 
total numbers of farms in each color-tenure group. For the tabulations 
under (1), it was decided to use the total number of farms in each value 
group for expanding the sample data; that is, the sample figures for 
each value-of-farm-products group were multiplied by the reciprocal 
of the sampling ratio for that value group. 

For nearly all of the items sampled, the totals for all value-of-farm- 
products groups were also known.’ On such items, the expanded sample 
figures for each value-of-farm-products group were scaled to equal these 
previously published totals by simple proportionate adjustments. In 
some of the tabulations, cross classification tables were involved, such 
as numbers of farms cross classified by total value of farm products and 
by size of farm. In such cases, where both the column and line totals 
were known, the expanded sample figures for all cells were adjusted to 
equal these known totals by the process of iterative proportions. That 
is, a succession of horizontal and vertical proportionate adjustments 
were made until the frequency table became rigid. These adjustments 
were made for the convenience of any user who might wish to relate the 
figures to previously published Census data. 

It was recognized that for some of the tabulations under (1), par- 
ticularly such items as all land in farms, value of land and buildings, 
and expenditures for specified items, some increase in precision might 
have been obtained had the sample data been expanded by using an 
inflation factor based on the total value of farm products for each value- 
of-products group. The decision to use total number of farms in ex- 


? For two value-of-farm-products groups the unclassified farms and farms with no farm products 
sold, traded, or used by farm households, only the total number of farms was available. 

3 The same is true for item totals for all major-source groups and for all color-tenure groups. The 
discussion in the remainder of this paper relates only to the tabulations outlined previously under (1), 
since only the figures for these tabulations were available at the time this paper was written. 
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panding the sample was determined partly by administrative reasons 
and partly by the fact that, since the tabulation sort was by value of 
farm products, much of the over-all relationship between value of farm 
products and such items as value of land and buildings and farm ex- 
penditures had already been taken into account. 

A third expansion factor, which could have been used, was the re- 
ciprocal of the sampling ratio based on number of farms for the total 
sample, that is, for all value groups combined. Obviously this would 
have given less precision than that obtained by the use of sampling 
ratios for the individual value groups. This would be particularly true 
for the terminal value groups where the population numbers of farms 
are relatively small and the sampling ratios quite variable. 


SAMPLING ERRORS FOR SOME SPECIFIED ITEMS 


The first tabulations completed under (1) involved principally pro- 
portions of the total number of farms in each value group reporting 
specified characteristics. The exceptions were such items as acreage of 
all land in farms, acreage of cropland harvested, value of land and 
buildings, age of operator, and number of days of work off farm. 

It is well known that when the number of farms in the sample is 
large, the relative sampling error (coefficient of variation) will be small. 
Thus, in the central value groups, for which the sample number of 
farms is large, it is to be expected that the relative sampling errors will 
be considerably smaller than those for the ter:ninal value groups in 
which few farms are represented. Hence, for these terminal value 
groups, the figures presented for some of the geographic regions may be 
subject to considerable error, where based on sample data rather than 
complete tabulations. For example, there is some evidence that, in 
order to obtain a 10 per cent relative error on cropland harvested for 
the “$20,000 and over” value-of-products group for the Pacific Division, 
it would be necessary to increase the sample size from 2 per cent to 
about 5 per cent; while for all land in farms, the same relative accuracy 
could be obtained only with a sample of approximately 15 per cent. 

The relative errors to be expected differ as between items. For ex- 
ample, in estimating age of operator, the size of sample necessary for a 
given relative error is considerably less than that required for all land 
in farms, cropland harvested, value of land and buildings, and work 
off farm. Some evidence is available from California and Illinois: 

For the “Unclassified farms” and for the “Farms with no products’ 
in Illinois, it was found that, to secure the same relative error on all 
land in farms and value of land and buildings as for age, it would be 


9 








84 AMERICAN STATISTICAL ASSOCIATION: 


necessary to obtain samples 6 to 8 times as large. The same was true of 
value of land and buildings and work off farm for California farms in 
the “Farms with no products” classification. On the other hand, for the 
“Farms with no products” group, it would be necessary to increase the 
size of sample on cropland harvested nearly 50 times in Illinois and 
about 90 times in California to obtain the same relative accuracy as for 
average age of operator. The farms in this no-income classification in- 
clude not only small part-time farms near villages and cities, but also 
some farms with large acreages but no income because of crop failure 
or other unusual conditions, which accounts for the large variability. 

For the “$10,000 to $19,999” value-of-products group in Illinois and 
the “$10,000 to $19,999” and “$20,000 and over” value groups in 
California the sample size for all land in farms would need to be from 
40 to 80 times as large as that for age of operator to secure the same 
relative accuracy. For cropland harvested and value of land and build- 
ings for these value groups the indicated increase in sample size to 
secure the same precision as for average age ranges from fivefold to 
twentyfold. The farms with large total acreages in the higher value 
groups include many with a high proportion of the total area made .up 
of pasture land, woodland, and rough land, rather than cropland. Such 
farms have a relatively low value per acre. On the other hand the small 
size farms in these upper value groups are hatcheries, greenhouses, 
nurseries, and other highly specialized operations for which the average 
value of land and buildings per acre is comparatively high. The problem 
of the extreme variability in size of farm, particularly in the western 
states, was partly met by making complete tabulations of the acreage 
of all land in farms, cropland harvested, and value of land and buildings 
for the very large size farms, generally those of 30,000 acres and over. 

For work off farm in the highest value-of-products groups the sample 
data for California indicate that it would require a sample approxi- 
mately 250 times as large as that for average age of operator to secure 
the same relative precision. In these value groups the frequency for 
number of days operator worked off farm is distinctly bimodal. Most 
operators of large-income farms report very little or no work off farm, 
but a few such operators report nearly full-time nonfarm employment, 
generally at professional or business occupations. To secure statistics 
on work off farm for these value groups with a relative error of 10 per 
cent would require approximately a 33 per cent sample, which figure 
includes a correction for sampling from a finite population. 

The table shows the coefficients of variation by value-of-farm-prod- 
ucts groups for three of the farm characteristic items for the East South 
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TABLE I 


PER CENT COEFFICIENTS OF VARIATION FOR PROPORTION OF TOTAL FARMS IN 
EACH VALUE-OF-FARM-PRODUCTS GROUP REPORTING SPECIFIED 
CHARACTERISTICS, FOR THE EAST SOUTH CENTRAL AND 
PACIFIC DIVISIONS 























CV for proportion of total farms in 
each value group— 
Division and value group — ; Reporting 
access Operated by | Reporting horses and/or 
full owners no work off mules and no 
farm tractors 
Per cent Per cent Per cent 
EAST SOUTH CENTRAL DIVISION 
Unclassified farms................ 3,888 18.6 13.8 13.3 
Farms with no products, ete.t...... 10,684 7.4 7.9 13.1 
RE ree 62,979 3.0 2.8 4.1 
ERE ree rer 4% 218,150 7 1.3 1.4 
Ne tae cae Abed kale 222,481 1.8 2.3 1.0 
i a ie i ne i ie aah 207 ,353 1.9 1.0 0.9 
conde dncaeiaws dateu ne 93 ,290 2.7 1.4 1.3 
ees oc ao eee ee ke ewee 84,093 2.7 1.4 1.4 
er re 64,299 2.8 Ben 1.6 
Fe 2 ere 22,778 4.0 2.8 2.9 
DP 6 vscaccscanceane 10,720 5.4 4.2 4.6 
ee ee 12,701 5.8 3.4 4.7 
RN ere oe 5,175 8.1 5.5 10.3 
SS ere 2,963 12.3 7.4 18.7 
$10,000 to $19, 999f.............. 1,310 21.0 10.1 46.4 
rer Teer 485 105.6 23.2 70.2 
PACIFIC DIVISION 

Unclassified farms................ 1,501 16.6 19.4 32.0 
Farms with no products, etc.T...... 8,737 5.1 9.9 18.7 
EPEC 19,575 2.5 6.3 11.4 
No cas acaasanaey 32,449 1.9 5.3 7.4 
ad i as ou neg een ee at 27 ,578 32 5.4 6.9 
En icccwt a chamanie Te 26,492 2.3 5.2 6.5 
le aig oe ae in eet 14,900 3.2 5.8 7.4 
I i ats ica she tas 19 ,637 3.1 4.6 6.1 
$1,000 to $1,499....... axe aeiecdc 28,491 2.5 3.1 4.9 
FO eae ee 19,575 3.8 3.7 6.3 
$2,000 to $2,499..... jw ad 14,259 4.5 4.0 7.5 
$2,500 to $3,999.............005. 24,899 3.7 2.8 6.8 
8 a ee 14,726 6.0 3.1 10.1 
$6,000 to $9,999................. 11,521 8.3 2.9 16.4 
$10,000 to $19, 999....... ns 7,654 10.2 2.8 17.8 
$20,000 and overt............... 4,179 16.0 6.0 25.9 











* The sample size approximates 2 per cent of the numbers shown in this column. 

t Farms with no farm products sold, traded, or used by farm households. 

t For these items, sample data only were available for farms with $10,000 and over total value of 
products. 


Central and Pacific Geographic Divisions. The East South Central 
Division, which is next to the largest geographic division from the 
standpoint of number of farms, includes Kentucky, Tennessee, Ala- 
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bama, and Mississippi. The Pacific Division, which is next to the small- 
est geographic division in number of farms, includes California, 
Oregon, and Washington. In the East South Central Division the farms 
are concentrated in the smaller-income groups; in the Pacific Division 
the reverse is true. For these items, as well as for part of the other 
tabulations listed under (1), the sample cards for the farms of $10,000 
and over were used instead of the complete tabulations. This procedure 
was necessary because the complete tabulations for the large-income 
farms could_not be completed in time for use in these first studies. 

This table illustrates the difficulty of sampling characteristics of 
farms in the terminal value groups where the numbers of farms are 
small. In the East South Central Division the sample indicates that 
full owners comprise only a very small proportion of the operators of 
farms with total value of farm products of $20,000 and over. To secure 
this statistic with a relative error of 10 per cent would require a sample 
of approximately 20 per cent of the farms in that value group. For the 
proportion of farms reporting no work off farm, a characteristic more 
frequently reported in this value group, it would require a sample of 
about 5 per cent to secure a relative error of 5 per cent, or a sample of 
about 25 per cent to obtain a relative error of 2 per cent. For the same 
value group it would be necessary to obtain a sample of about 30 per 
cent in order to secure a figure on proportion of farms reporting horses 
and/or mules and no tractors with a relative error of 5 per cent; if the 
desired precision were 2 per cent, a sample of 70 per cent of all farms in 
that value group would be needed. These comparisons are based on 
calculations which make allowance for the fact that the sampling is 
from a finite population. However, the formula used is for an unstrati- 
fied sample; therefore, the coefficients of variation shown in the table 
may slightly over-state the error. 

The sampling errors discussed here apply only to the figures for in- 
dividual cells. However, the primary purpose of these tabulations is to 
determine patterns of relationships. The tabulations outlined under (1) 
were made on a state basis, although for most, if not all, items publica- 
tion will be restricted to figures for geographic divisions. The con- 
sistency between states in the patterns of the various relationships 
justifies more confidence in the results, when used for an analysis of re- 
lationships, than might be inferred from the relative errors calculated 
for the individual cells. 








MARKETING AND SAMPLING USES OF POPULATION 
AND HOUSING DATA* 


By A. Ross Ecker Anp E. P. Stauptr 
Bureau of the Census 


HE 1940 ceNsUs provides many types of population and housing 
'T vstatisties useful for the establishment of marketing programs, and 
in particular for the effective operation of sample studies required for 
market analysis. These data have much significance even in a period of 
sharply contracted output and distribution of civilian goods. It is now 
more important than ever that goods be marketed as efficiently as pos- 
sible so that the limited supply can be sold at the lowest cost and at the 
least inconvenience to customers. 

Statistics on the size and characteristics of the population and on the 
characteristics of the nation’s homes are of basic importance to market- 
ing men, since people are the buyers and consumers of the great bulk 
of goods produced for sale. Such data as the age and sex composition of 
the population and the tenure and rental value of homes are widely 
used as market indicators. It may be useful to mention briefly certain 
respects in which some of these uses have been modified during the 
war period, but it is not necessary to describe them here since they have 
been discussed in considerable detail elsewhere. Consequently, the 
greater portion of this paper will be devoted to a description of two 
types of use that have received relatively little attention hitherto but 
may become of much importance: (1) To describe the market with 
which a particular business organization is in contact, and (2) to pro- 
vide area and other controls needed for special sampling studies of 
markets. 

SIZE AND CHARACTERISTICS OF THE POPULATION 


There are at least two ways in which the changes arising from war 
have affected the relative importance of data on the size of the popula- 
tion and of those on its characteristics. First, data on the size of the 
population have increased in relative importance because the inter- 
class differences in buying patterns have been greatly reduced. This 
reduction is a result of the disappearance of many types of durable 
and luxury goods and the rationing and price control of certain other 
commodities. At a time when so large proportions of the population 
have sufficient purchasing power to buy more goods than they can get, 
it is less necessary than usual to formulate marketing policy in terms 


* This paper was prepared for presentation at the 104th Annual Meeting of the American Statistical 
Association which was to have been held at Cleveland, Ohio, December 29 to 31, 1942. 
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of variations in economic status. The sales of sugar, coffee, and gasoline 
are now being made on either a population or a need basis, and respond 
little, if at all, to changes in the purchasing power of various groups. 
Any further increase of rationing programs will expand the zone within 
which the volume of sales is almost directly proportional to the size of 
the population. 

Second, data on the size of the population have particular significance 
when size is changing rapidly. The tremendous scale of internal migra- 
tion revealed by recent estimates of the civilian population (based 
mainly on sugar registration figures) calls for a certain amount of re- 
vision in distribution programs, in the setting of sales quotas, and in the 
allocation of salesmen. These revisions should be made, however, with 
full regard for past population trends. If further revisions of market 
programs need to be made because of decreases in the amount of avail- 
able goods or in number of personnel obtainable, every effort should be 
made to favor areas where past population trends suggest the likelihood 
of the continuation of expanding markets after the war.'! 


HOUSING DATA AS INDICATORS OF MARKETS FOR 
PARTICULAR KINDS OF GOODS 


Data from the Housing Census can be used to some extent as direct 
measures of the potential markets for certain types of goods. Thus, in- 
formation on specific types of equipment, such as mechanical re- 
frigerators, radios, and central heating plants, indicates the number of 
dwelling units without a particular type of facility and thus the number 
of families in the untapped segment of the market. Information on the 
number of homes with electric lighting indicates the total number of 
famiiies to which electrical equipment might be sold. 

By the use of housing data on facilities in combination with data on 
population characteristics it is possible to determine the area, color, and 
tenure groups that form the largest potential markets for specific kinds 
of equipment. Most of these uses are of purely theoretical value at 
present since products of these types are not being produced and the 
supply still available for sale is negligible, but such marketing uses of 
housing data will become very important in the postwar period. 

One exception to this generalization results from the rationing of fuel 
oil. Information on the number of homes having oil burners has been 
used as an indication of the potential market for various kinds of al- 
ternative heating devices, such as coal burners and fireplace grates, 
as well as for conversion equipment, and for insulating materials and 


1 This subject is discussed in a paper entitled “Population Shifts and Income Changes,” by Dr. 
Philip M. Hauser, Assistant Director, Bureau of the Census, presented on January 15, 1943, at the 
Wartime Marketing Conference of the American Management Association, in Chicago, Illinois. 
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other means of reducing heat loss. Housing Census data on home equip- 
ment may increase in importance if prolongation of the war economy 
requires a marked expansion in repair services. 


HOW CENSUS DATA CAN BE USED TO SHOW CHARACTERISTICS 
OF PARTICULAR GROUPS OF THE POPULATION 


Restrictions on the disclosure of data collected by enumerators make 
it impossible for the Bureau of the Census to furnish any consumer of 
statistics with information relating to a particular person or organiza- 
tion. The Bureau may, however, provide data in the form of totals 
pertaining to groups of people or organizations, provided a number of 
persons or firms are included. On this basis, it is possible to derive 
directly from census schedules compilations of the social and economic 
characteristics of groups of individuals that may be of much impor- 
tance to a business organization—groups such as the subscribers to a 
magazine, the charge-account customers of a store, the patrons of a 
public utility, and similar groups of customers or subscribers for whom 
names and addresses are available for identification. 

Statistics on such groups can be used in various ways in the market- 
ing program of a particular organization. For example, data on the 
characteristics of the subscribers to a magazine may be very useful to 
the advertising department since many advertisers normally concen- 
trate their expenditures upon publications that reach people in particu- 
lar economic or social strata. Data on subscribers might be collected, of 
course, by direct field canvass or by mailed questionnaires, but both 
methods involve a considerable amount of planning and are subject to 
certain limitations. Information already collected on the decennial 
census schedules is much more comprehensive than any that can be 
readily obtained through field canvass, and has the merits of uniformity 
of procedure, completeness of coverage, and full cooperation on the part 
of the respondent. Moreover, data can be obtained in this manner from 
census schedules at a cost far below that necessary for the collection of 
similar information through field interviews. 

The possibilities of this procedure may be illustrated by a brief de- 
scription of several jobs which have been done on a contract basis for 
publishing firms. Two of these undertakings were carried out for the 
Curtis Publishing Company in connection with an analysis of a sample 
of subscribers to either the Saturday Evening Post or the Ladies’ Home 
Journal and of a similar sample for the Country Gentleman. Under the 
procedure used, sample lists of subscribers to these magazines were sent 
to the Bureau of the Census for matching with names and addresses on 
the census records of 1940. Because of population movements since 
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1940 and other reasons, some of the names could, of course, not be 
located on the census schedules. For all those that were located, all 
available information was taken from the population schedule, the 
housing schedule, and (for persons living on farms) from the farm 
schedule. The tabulations of data thus assembled furnish a compre- 
hensive picture of the economic and social characteristics of subscribers 
to the particular magazines, without disclosing any facts pertaining to 
a given individual. 

A similar job has been done for the Meredith Publishing Company, 
the publisher of Successful Farming and Better Homes and Gardens. The 
conduct of this job was in general similar to that for the Curtis Publish- 
ing Company, although there were many differences in detail. The work 
entailed the selection of a sample list of subscribers to Successful Farm- 
ing, the matching of this list against the farm schedules, and the tran- 
scription of selected agriculture, population, and housing data. 

On the basis of such materials, publishing companies can determine 
the extent to which their subscribers are typical of the general popula- 
tion and in what respects their distribution with respect to such items 
as education, income, rent, and housing indicates that the subscribers 
are a specially selected group. Editorial as well as advertising policies 
may advantageously be adjusted to findings with regard to the com- 
position of the subscriber-group. 

The technique of using population and housing census files with their 
records of nearly three biilion facts about people and the homes they 
live in seems to have considerable promise for recurrent use as a means 
of revealing the characteristics of particular groups. It seems likely that 
the reopening of old markets and the development of new ones after 
the war may well lead to a very substantial increase in the opportunities 
for effective use of this technique, provided the results of an up-to-date 
census are then available. The method is most useful shortly after a 
census when people are still at the addresses where they were enumer- 
ated. The amount of population shifting is probably great enough and 
the process of migration is selective enough to restrict seriously the 
usefulness of this approach when the latest census schedules are four or 
five years old. 


USE OF SMALL AREA DATA FOR SAMPLING 


The 1940 Censuses of Population and Housing have been so tabu- 
lated as to provide an unprecedented amount of material for relatively 
small areas. Because of the usefulness of such data for many purposes, 
particular emphasis was placed upon small areas in the housing census. 
Data were published by blocks for 191 cities with a 1930 population of 
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50,000 or more. These data include the number of dwelling units and 
structures, dwelling units by tenure and occupancy, average rent, 
number of units needing major repairs or having no private bath, num- 
ber occupied by nonwhites, number with more than 1} persons per 
room, and year built. As an aid to the use of these block data, the more 
significant items are to be shown on block maps for cities with a popu- 
lation of 100,000 or more. The maps greatly facilitate retail site selec- 
tion, delineation of sales districts, and the setting of sales quotas within 
a given city. They also are basic to any study of the structure of cities 
and the shifts in residential neighborhoods which greatly affect sales 
outlets. This is the first time that detailed information of this sort has 
been tabulated on a nation-wide basis for such small segments of cities. 

Addititional data are available for census tracts in 60 large cities 
(including adjacent areas in some cases) which have been subdivided 
into tracted areas. Although census tracts are considerably larger than 
blocks, they are still relatively small areas, usually between 3,000 and 
6,000 in population. The population items available by census tracts 
include sex, age, race, nativity, citizenship, country of birth, highest 
grade of school completed, employment status, class of worker, oc- 
cupation, and industry. Data on housing for census tracts include oc- 
cupancy status, tenure, value or monthly rent, type of structure, state 
of repair and plumbing, size of household, persons per room, radio, re- 
frigeration equipment, and heating fuel by type of equipment. 

Finally, to supplement the block data, a special summary card has 
been prepared for each of the 110,000 enumeration districts outside of 
cities of 50,000 or more. The data available on these summary cards 
include total population and population density, and totals, ratios, or 
averages pertaining to color, sex, farm residence, dwelling units, home 
ownership, rent, farm population, farm acreage, crop acreage, and farm 
value. Since the average population of an enumeration district is about 
900, and most of them are between 300 and 3,000, the enumeration 
district data provide information on characteristics of segments of the 
population that are highly suitable for sample use. The areas are com- 
pact geographical units, so that samples selected within such areas can 
be readily visited by a single enumerator or can be easily located on 
census schedules. Moreover, the enumeration districts are small enough 
to be homogeneous with respect to most characteristics. Hence, the 
variance within the area is likely to be small enough to simplify the 
drawing of a sample from the area. 

By means of the statistics available for small areas, it is possible to 
work out a sample for large areas which is well stratified with respect 
to the characteristics that have proved most useful in marketing sur- 
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veys. Some work based on census tracts has already been done along 
this line by one of the national organizations interested in marketing.’ 


SPECIAL ASSISTANCE AVAILABLE FROM THE CENSUS STAFF 


There are at least three methods whereby these small-area data may 
be useful to marketing organizations and other organizations interested 
in market surveys. First, they make possible a substantial advance in 
the application of sampling theory to various types of problems. Work 
which has already been done by the Bureau of the Census, some of 
which is being described elsewhere in this JouRNAL’ and some of which 
will be presented later, provides the basis for conclusions that are of 
much value to individuals and research organizations concerned with 
sampling. 

In the second place, the Bureau can render important assistance to 
marketing and other organizations by reviewing the problems involved 
in a particular survey and making suggestions concerning the best 
sampling technique as indicated by census experience and statistics. 
If the desired degree of precision of results can be stated in advance, it 
is possible to determine the necessary size of sample for the purpose 
and to estimate the approximate cost of the field or office work. 

The Bureau can also be of assistance in connection with the evalua- 
tion of completed sample surveys. Its experience with data on the basis 
of blocks and enumeration districts enables it to review the adequacy of 
the particular sample design which has been used and suggest how the 
results may be made most reliable. Analysis of the sample in terms of 
blocks or other small areas may be useful in revealing what can be in- 
ferred from a particular sample design, whether it was efficient, whether 
it tended to overweight areas of a special type, etc. For certain classes 
of data, the representativeness of the sample can be determined by 
testing it against the complete returns for 1940 and observing how 
closely the sample approximates the universe. 

Finally, it cannot be too strongly urged that persons and agencies 
realizing the many advantages of sampling procedures for marketing 
studies take the opportunity of consulting the Bureau of the Census 
at an early stage in their work. Service can be rendered most promptly 
and effectively when plans for a survey are being formulated. The 
Bureau is hopeful that its experience and facilities may be utilized to 
an increasing extent by persons and organizations responsible for 
setting up and operating field surveys. 

2 For a description of the work, see Business Week, May 2, 1942, pp. 48-49. 


? Benjamin J. Tepping, William N. Hurwitz, and W. Edwards Deming, “On the Efficiency of 
Deep Stratification in Block Sampling,” this Journau, March, 1943, pp. 93-100. 








ON THE EFFICIENCY OF DEEP STRATIFICATION 
IN BLOCK SAMPLING* 


By BEenJaMIN J. Teppine, Witu1amM N. Hurwitz, 
AND W. Epwarps Deminef 
Bureau of the Census 


OUSING STATISTICS from the 1940 Census are available in publica- 
H tions of the Bureau of the Census for every occupied block in cities 
of 50,000 or more inhabitants, and considerable interest has been no- 
ticed in the use of these statistics for designing sample surveys. In par- 
ticular, there has been interest in the use of sample designs in which 
restrictions are introduced comparable to those used in experimental 
designs of the latin square type.' Such sample designs will be referred 
to below as utilizing deep stratification, and these are to be contrasted 
with standard stratified designs. The difference between deep and sim- 
ple stratification is illustrated in the following paragraphs. 


(a) Example of stratification with complete sampling—In selecting a sample 
of 16 blocks from Wilmington, all of the blocks in Wilmington might be 
divided into 4 geographic groups, and each of these groups further sub- 
divided into 4 classes on the basis of the average monthly rent for the block. 
One block could then be selected at random from each of these 4 rent classes. 
This is an example of a standard design—complete sampling, each of the 16 
strata being represented in the sample. 

(b) Example of deep stratification with incomplete sampling—lIt is possible 
to go further, and separate |the blocks of leach of the above’4 rent classes 
into 4 groups on the basis of the number of dwelling units in the block, and 
then to separate the blocks of each of these groups into 4 groups based on 
the average number of persons per room. The result is a table with 256 cells 
(44X44). It is possible to select a sample of only 16 blocks from these 
256 in such manner that in the sample there is representation from each 
geographic location, each rent group, each group based on the number of 
dwelling units, and each group based on the number of persons per room. 
This design is balanced with respect to each criterion of stratification con- 
sidered separately, but the sampling is incomplete in that only 16 out of 
256 cells within the table are represented. 


The purpose of this paper is to present the results of part of a series 
of investigations that are being carried out on the experimental and 


* This paper was prepared for presentation at the 104th Annual! Meeting of the American Statistica 
Association which was to have been held at Cleveland, Ohio, December 29 to 31, 1942. 

t The authors acknowledge with pleasure and gratitude many valuable suggestions from Mr. 
Morris H. Hansen of the Bureau of the Census, not only in the planning of the experiments, but also 
in the preparation of the manuscript. 

1 Adaptation of experimental designs of the latin square and graeco-latin square types to sampling 
with “deep stratification” was originated some years ago by Lester R. Frankel and J. Stevens Stock, and 
was reported in a paper “On the Sample Survey of Unemployment,” this Journat, Vol. 37, March 
1942, pp. 77-80. 
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theoretical evaluation of the sampling errors and biases of several sam- 
ple designs involving deep stratification, and to compare them with 
some sample designs commonly used. The experiments were con- 
structed so as to make possible an examination of changes in the effi- 
ciency of a design that result from adding controls beyond the point 
where each stratum can be represented in the sample, but retaining the 
same size of sample. The restricted designs investigated are some that 
have been proposed for possible use. 


SOME DEFINITIONS 


A single sample, for our purpose here, will consist of the data ob- 
tained for some one population characteristic, from a set of blocks 
drawn from a city according to some rule. A sample design is character- 
ized by the rule for drawing the blocks, and the procedure for calculat- 
ing the sample results (the estimating technique). Any number com- 
puted from the characteristics of the people or dwelling units in the 
sample is a sample result, an example being the average rent paid per 
dwelling unit, calculated from all the dwelling units in the sample. 
Successive samples, i.e., successive sets of blocks drawn by successive 
applications of the rule, will generate a population of results. The bias of 
a particular sample design, for a certain population characteristic, is 
the difference between the average estimate obtained from successive 
samplings, and the value of that characteristic that would be obtained 
by extending the survey to the entire city. A bias will arise if 

(a) the method of selecting the sample is such that the blocks have differ- 
ent probabilities? of being drawn into the sample, and if 

(b) the estimating technique fails to correct for these differences in the 
probabilities. 


The appearance of the successive results for a biased design® will be 
something like those shown in the chart (q.v.), if the sampling is ran- 
dom. 

The standard error of a sample estimate is a measure of the average 
discrepancy between the sample estimate in a large number of succes- 
sive samples, and the average of the sample estimates. It can be esti- 


2 The “probability that a block will be included in the sample” is here the ratio of the number of 
possible samples that contain the block to the total number of possible samples, when the combinations 
of blocks are made up according to a given sample design. With sufficient care and experience in carrying 
out an actual sample survey, it is possible not only to assimilate the theoretical calculations, but to be 
sure of this in advance. 

? These types of biases are ordinarily much larger than another kind of bias that will be ignored 
here, specifically, a small bias that arises in estimating a ratio (e.g., average rent), when the estimates of 
both the numerator and the denominator are unbiased. The bias thus arising in a ratio is often introduced 
deliberately when it is known to be small, in order to gain the advantage of a smaller variance and the 
consequent greater reliability of the sample results. 
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mated from actual repeated samplings in an experiment, or calculated 
from the formula for the standard error arrived at through theoretical 
considerations of the sampling design, if such a formula has ever been 
derived. These methods can be used interchangeably if the sampling 
demonstrably conforms closely enough with the theory that gave rise 
to the theoretical standard error. 


RESULTS OF REPEATED SAMPLING ACCORDING TO PLAN III 
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SAMPLE NUMBER 


Norte: Each point shows the average monthly rent per dwelling unit, estimated from a sample of 
22 blocks drawn according to Plan III. Observe the wide variation in these estimates, and the fact that 
their average is $1.70 less than the average monthly rent for the city. 


SOME EXPERIMENTS WITH BLOCK SAMPLES 
FROM WILMINGTON 


Each of the samples discussed in this paper consists of 22 blocks, 
which are approximately 2 per cent of all the blocks in Wilmington. 
While for most purposes this sample size is too small to give useful 
estimates of the characteristics of the city, it should be sufficient to 
distinguish between the efficiencies of the various sample designs under 
investigation. The standard error of a sample design is, of course, a 
function of the size of the sample. For the standard designs, the stand- 
ard error of a sample estimate will vary exactly or approximately as the 
reciprocal of the square root of the size of sample; for the deep stratifi- 
cation designs this simple relationship does not necessarily hold. The 
reliability of the estimates for any of the sample designs could be im- 
proved simply by making the sample larger. Also, much better esti- 
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mates of the population characteristics could be obtained with the 
same number of dwelling units by subsampling dwelling units within 
the blocks; i.e., by selecting a larger number of blocks but enumerating 
only some of the dwelling units in the blocks selected. It was not feasi- 
ble to include subsampling in the present set of experiments. However, 
both of these improvements (larger sample and subsampling) can be 
applied equally well to the standard designs (A, B, C, D) and the deep 
stratification designs (I, II, III, IV). Comparisons of the biases and 
standard errors among all these designs are therefore useful in spite of 
the fact that in practice any of them might be modified. 


DESCRIPTIONS OF THE DESIGNS INVESTIGATED 


The sample designs that this paper deals with are described in the 
following outline. The four standard designs used here for comparison 
are denoted by A, B, C, D, and the designs involving deep stratification 
are denoted by roman numerals. 


PLAN STRATIFICATION SAMPLING 


A None 22 blocks drawn at random, 
without replacement 


B Geographic only. The 1100 blocks of Wil- One block drawn at random 
mington are divided into 22 geographic from each group 
groups, each containing the same number of 
blocks 
C* Geographic location and average monthly One block drawn at random 
rent per dwelling unit. The blocks of Wil- from each _location-rent 
mington are divided into 5 geographic class 
groups, each with the same number of 
blocks. Within each of these groups they are 
divided into 4 rent classes, in such fashion 
that each class contains the same number of 
dwelling units, or as closely so as possible. 
The number of dwelling units in each of the 
4 rent classes of one geographic group is not 
necessarily equal to the number of dwelling 
units in each of the 4 rent classes of some 
other geographic group 
D‘ Geographic location and number of dwelling One block drawn at random 
units per block. The blocks are divided into from each location-size class 
5 geographic groups each with the same 
number of blocks, as in Plan C. Within each 
such group, they are divided into 4 size 


4 Note that Plans C and D, by design, provide for 20 strata, and hence for the inclusion of only 20 
blocks in the sample. In comparing these plans with the other designs, their sampling errors will be 
adjusted to an equivalent sample sise of 22 blocks. 
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II 


III 


IV 


classes, in such fashion that each class con- 
tains the same number of dwelling units, or 
as closely so as possible. The number of 
dwelling units in each of the 4 rent classes of 
one geographic group is not necessarily 
equal to the number of dwelling units in 
each of the 4 rent classes of some other geo- 
graphic group 

The blocks of the city are divided into 22 
geographic groups, each containing the same 
number of blocks. The blocks of a geo- 
graphic group are then divided into 4 size 
classes,5 each with the same number of 
blocks. Within each size class, the blocks are 
divided into 4 rent classes, each with the 
same number of blocks. Thus, each geo- 
graphic group is broken into 16 cells (4X4), 
and every cell contains the same number of 
blocks 

Same as Plan I, except that the rent classes 
are determined by dividing the blocks in an 
entire geographic group into 4 rent classes, 
each with the same number of blocks. The 
16 size-rent classes no longer necessarily con- 
tain the same number of blocks 

Same as Plan II, except that the quartiles 
for rent and size are determined by the num- 
ber of dwelling uniis in the quartile, as in 
Plans C and D, rather than by the number 
of blocks. The 16 size-rent classes do not 
necessarily contain equal numbers of dwell- 
ing units 

Same as Plan III 


One block drawn from each 
geographic group, with the 
restriction that in any set of 
4 consecutive geographic 
groups, no two blocks are to 
be drawn from the same 
rent quartile or from the 
same size quartile 


Same as Plan I 


Same as Plans I and II 


One block from each geo- 
graphic group, with the re- 
striction that in any set of 
16 consecutive geographic 
groups, no two blocks are 
to be drawn from the same 
size-rent class 


In the case of all the designs except Plan II, the estimate of the pro- 





portion of dwelling units with a given characteristic is made by divid- 
ing the number of dwelling units with the characteristic in the sample 
by the total number of dwelling units in the sample. For Plan II, the 
estimates are made by weighting the number of dwelling units in each 
sample block by the number of blocks in the stratum from which the 
sample block was selected. 


5 The size of a block is the number of dwelling units that it contains. 
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RESULTS AND REMARKS 


The table shows the results that were obtained from the four designs 
involving deep stratification, and the results that would be obtained, 
on the average, from the four “standard” designs. For Plans A, B, C, 
and D, the standard errors and biases were calculated from appropriate 
approximate formulas, while for Plans I, II, III, and IV, they were esti- 
mated from repeated sampling. Plan A is unbiased, like Plans I and B, 
because the sampling was carried out so that the probability of being 
drawn into the sample is the same for every block.” Plan II is also un- 
biased because, although the probabilities there are different, the sam- 
ple results were computed in a manner that takes these differences into 
account and adjusts for them (vide supra). Plans C and D, and III and 
IV, on the other hand, are biased, the bias being almost as great as the 
sampling error itself for some of the plans and for some of the character- 
istics estimated. 

It sometimes happens that both the bias and the variance of a biased 
estimating technique are so small that the biased technique is prefera- 
ble to an unbiased technique having a larger variance. An example was 
mentioned in footnote 3. In comparing the utilities of various designs, 
the losses and gains of both biases and variances should always be 
taken into account. The purpose of presenting these results for Wil- 
mington is to furnish some evidence to assist decisions with regard to 
comparisons between biases and variances. 

The table shows that every design except Plan II yields a gain in 
information over unrestricted random sampling (Plan A). With the 
standard stratification designs B, C, and D, it would be most unusual, 
although possible in sampling from finite populations, to sustain losses 
over random sampling, and the standard designs actually used did lead 
to gains over Plan A as expected. 

Plan I is the same as Plan B except for the additional controls in the 
former furnished by size and rent. A comparison of the results indicates 
that the additional controls used in Plan I may produce a gain in infor- 
mation for some characteristics, but a loss of information for other 
characteristics. 

Plan III is a restricted design that appears to give a better allocation 
of blocks than Plan I does to the strata in which there may be a greater 
variation of the characteristics in question. While it is not clear why 
this design should be more efficient than Plan I, repeated samplings of 
Wilmington indicated that for the proportions owner-occupied, and 
needing major repairs or with no private bath, Plan III gives estimates 
that are closer to the values for the city than Plan I gives, since in spite 
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of the fact that the sampling bias in Plan III is of a magnitude com- 
parable with the sampling error, the variance is small. This gain is suf- 
ficiently striking in Wilmington to justify the research that is now being 
conducted to discover how the efficiency of this design is affected by 
such things as the concentration or dispersion of a population charac- 
teristic (e.g., vacancy) over the area of the city being sampled. 

Plan IV involves a simple modification of the sampling restriction 
used in Plan III. This modification has the effect of requiring not only 
that each rent quartile and each size quartile shall be represented in 
the sample, but that every possible combination of rent and size shall 
be represented. It appeared, however, that this design is less efficient 
than Plan III, and that there is even a loss compared with Plan B. 

A more efficient design® than those described is one in which the 
blocks are stratified by size in such manner that each of the cells con- 
tains the same number of dwelling units. The sampling is to be carried 
out so that the probability? that a block in a given cell be included in 
the sample is in proportion to the size of the block,® but a constant 
number of dwelling units is to be drawn into the sample from each 
block, regardless of size. This design is unbiased, and has a smaller 
variance than any of the four Plans C, D, III, or IV. This scheme of 
sampling applied to the cells of Plans C, D, III, and IV, even as they 
are, would reduce their biases and variances, in spite of the fact that 
in these plans the cells do not necessarily contain the same number of 
dwelling units. This reduction in bias should arise because the ex- 
pected values of the estimates are essentially the averages of the block 
averages, while for the sampling as proposed by Hansen the expected 
values of the estimates are the averages of the cell averages. If the 
number of dwelling units per block were the same for all blocks, the 
average of the block averages would be equal to the average of the 
cell averages, and both would be equal to the average for the city. 
However, when the number of dwelling units per block varies from 
block to block, the average of the cell averages is closer than the aver- 
age of the block averages to the average for the whole city, except in 
rare circumstances. Experimental work with this design has not thus 
far been feasible. 

While most of the deep stratification designs were substantially bet- 
ter than unrestricted random sampling in the Wilmington experiment, 
the fact that Plan II led to larger errors indicates that in some circum- 


stances, deep stratification will lead to a substantial loss over random 
* The principle of reducing the variance and the bias by making the probability that a block be in- 


cluded in the sample proportionate to its size, and then subsampling a fixed number of dwelling units 
from each block, is an innovation due to Morris H. Hansen. 





STANDARD ERRORS AND BIASES, FOR SELECTED CHARACTERISTICS 
The mean square errors for Plans I, II, III, and IV are estimated from 





AMERICAN STATISTICAL ASSOCIATION: 





the repeated sampling, whereas the mean square errors for 
Plans A, B, C, and D are calculated 


















































Characteristic 
Sample design . Proportion needing Proportion with 1.51 
Proportion major repairs or with | or more persons 
owner-occupied no private bath jf per room 
— 

Proportion for the en- 

tire city .383 . 258 -031 
Plan A (unbiased) 

Standard error .055 .066 .012 
Plan B (unbiased) 

Standard error .046 .057 .O11 
Plan C | 

Bias — .008 .001 — .000 

Standard error .046 .052 -010 
Plan D 

Bias — .027 .000 .002 

Standard error .041 .055 -O11 
Plan I (unbiased) 

Standard error .048 .055 .008 
Plan II (unbiased) 

Standard error .062 .070 .008 
Plan III 

Bias — .027 .021 .005 

Standard error .031 .039 .010 
Plan IV 

Bias — .027 .003 .005 

Standard error .040 .048 .010 











sampling. Deep sampling is thus not something to be introduced in- 
discriminately, though under some circumstances, deep stratification 
may indeed have greater efficiency than the standard designs. Of 
course, the experiments described in this paper are limited to block 
sampling in Wilmington and are not conclusive; further research on 
the subject is needed. Theoretical investigations of sampling errors and 
experimental work to determine the contributions to sampling error 
that may arise in areas of various types and for various characteristics 
are now being carried on in the Bureau of the Census. 

















THE INTEREST RATE AND SAVING* 


By Donatp 8. Tucker 
Massachusetts Institute of Technology 


interest so that an increase of the interest rate would cause a rela- 
tively large increase in the volume of saving. The year 1932 gave an 
exceptional opportunity for testing the accuracy of this view of the 
interest rate as an inducement to save. The premium for liquidity in 
1932 was high; but the long-term interest rate was also high. The pre- 
mium on liquidity caused safe short-term and other highly liquid bonds 
to sell at high prices and at low yields. Less liquid securities sold at 
prices so low that the yield on long-term investments was clearly well 
above normal. Did this call forth an increase of saving? A survey was 
projected. 

Preliminary discussion with various savers made it obvious that an 
inquiry on this subject could not be handled through a questionnaire. 
(1) Diminution of the family income had eliminated saving from many 
budgets. (2) Fear of possible further diminution or even of complete 
cessation of future income was causing many families to struggle fran- 
tically to curtail their current expenditures. (3) The interest rates paid 
by savings banks, the dividends paid by stocks and the coupon rates 
paid by bonds had all been either stationary or declining. It was neces- 
sary in almost every instance to explain that lower prices for fixed- 
income securities meant a higher current yield and a higher rate of 
interest. 

Among about two hundred investors who were questioned during 
this preliminary investigation, only two were found who were both try- 
ing to save a larger percentage of their incomes in 1932 than in 1929 
and thought that they were influenced in this by the interest rate rather 
than by a fear of further diminution of their incomes. In both instances 
the increase in the percentage saved was small. Because both of these 
savers had suffered from the general diminution of current income, 
absolute savings were in both instances smaller than in the previous 
years of prosperity. These two families were, however, trying to re- 
spond to the inducement of a higher interest rate. These were the only 
two out of about two hundred questioned. 

The amazing thing about this investigation, however, was not the 
small number of those affected by the interest rate; it was an even more 


pene has been represented frequently as elastic with respect to 


* This paper was prepared for presentation at the 104th Annual Meeting of the American Statisti- 
cal Association which was to have been held at Cleveland, Ohio, December 29 to 31, 1942. 
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fundamental fact. Economists, statisticians, brokers and investment 
bankers, in other words, those professionally interested in the financial 
markets recognized readily, it must be admitted, that low prices for 
fixed income securities meant higher interest rates; but others, even 
highly successful investors, had not thought of it. All those questioned 
understood this relation readily when it was explained—perhaps be- 
cause this small investigation was limited to a few intelligent and highly 
successful groups. Many of those questioned had been familiar with 
this relation for many years and some even were still receiving financial 
“services” which reported bond yields regularly. In the excitement of 
1932, however, this portion of these reports seems to have been neg- 
lected. In every instance those questioned agreed after some discus- 
sion that current security prices then indicated a high interest rate, a 
high premium for risk and a high premium for liquidity. The high 
interest rate had simply no special significance for them as individuals 
at that time. They had not even bothered to estimate the rate then 
current. 

If in prosperous and successful groups less than ten per cent had rec- 
ognized the existence of high interest rates, the effect of such interest 
rates on the behavior of the general population must be negligible. A 
large scale test of this conclusion was, however, impossible. It was nec- 
essary to distinguish between a shift of the curve and the elasticity of a 
supply curve. Savers would discriminate between them only after 
questioning and discussion. Because employment of a staff was impossi- 
ble, the survey was discontinued. This abortive study failed therefore 
to prove anything. It was, however, extremely suggestive. 

Low interest rates in more recent years have apparently done very 
little to discourage financial saving. That interest rates affect the will- 
ingness to invest is not questioned. The demand for investment funds 
and the supply of capital money available for investment are both af- 
fected apparently by the interest rate; but financial saving is only one 
of the streams that feed the reservoir of capital money (or uninvested 
funds) and current investment is only one of the outlets from that 
reservoir. This survey made no effort to measure the outlets. It made 
no effort to measure real liquidation, the other great inlet to this 
reservoir. Even with respect to financial saving it proved nothing be- 
cause of its small size; but it suggested strongly that the amount of in- 
come saved is not affected perceptibly by short-period changes in the 
rate of interest. In no single instance among those questioned did the 
1931-1932 rise in long-term interest rates result in an actual increase in 
the amount of money saved. 
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NOTE ON DR. BERKSON’S CRITICISM OF 
TESTS OF SIGNIFICANCE 


By R. A. FisHEer 
The Galton Laboratory 


HEN, about eighteen years ago, I was looking for a good example 
W: o illustrate the test of straightness appropriate to regression lines, 
I felt myself particularly lucky to find one in which the departure from 
linearity was of a somewhat unusual kind, and in which a superficial 
examination by graphical methods, without submitting the subjective 
impression to any objective test, was likely to be misleading. The case 
chosen was from a comprehensive paper by A. H. Hersh on the influ- 
ence ‘of temperature on the number of eye facets developed in Droso- 
phila melanogaster in a number of homozygous and heterozygous phases 
of the bar factor. Several of these phases showed to graphical inspection 
a remarkable discontinuity in the rate or direction of change in the 
neighborhood of 24° C. The heterozygote between the wild gene and 
that known as ultra-bar, on the other hand, gave frequencies which, if 
examined uncritically, or by methods capable of detecting only the ob- 
vious, would have failed to indicate that anything remarkable was 
happening at the critical temperature. It was, therefore, of some little 
interest to see whether a new criterion based on the rigorous solution 
of a problem of distribution then but recently cleared up, would enable 
the experimenter with such material to detect features of importance 
in his data, which, without such aid, would have escaped his notice. 

Dr. Joseph Berkson writing, curiously enough, in 1942, when the 
advantages of objective tests are at least more widely appreciated than 
they were, refers to this example in the September number of this 
JourRNAL. He has drawn the graph. He has applied his statistical in- 
sight and his biological experience to its interpretation. He enunciates 
his conclusion that “on inspection it appears as straight a line as one 
can expect to find in biological material.” The fact that an objective 
test had demonstrated that the departure from linearity was most de- 
cidedly significant is, in view of the confidence which Dr. Berkson 
places upon subjective impressions, taken to be evidence that the test 
of significance was misleading, and therefore worthless. 

It is not my purpose to make Dr. Berkson seem ridiculous, nor, of 
course, to prevent him from providing innocent amusement. Had he 
looked up Hersh’s original paper he would have been spared a blunder, 
but we should have lost an example of the dangers of authoritarian 
judgment, based on subjective impressions, which even at the present 
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date may be of value. Evidently, good biological data, examined by 
accurate methods, are capable of being much more informative than 
Dr. Berkson imagines. It is very well worth while to be reminded that 
general condemnations of “biological material” based on limited ex- 
perience, as Dr. Berkson’s judgment must be, may vastly underesti- 
mate the cogency of the evidence which careful and extensive work 
neatly provides. 

A further confusion arises in Dr. Berkson’s final comment on this 
example: “In this case my own judgment would be, not that the regres- 
sion is non-linear, but that the temperature has varied during each or 
some of the experiments. At least that would explain the small P.” As 
experimenters well know, one of the commonest uses for tests of sig- 
nificance is to detect errors of technique, or to confirm the belief that 
the technique has been adequate. Improbable as such an explanation 
seems in the case of Hersh’s data, it is in general true, and important, 
that a discrepancy between observation and expectation may be due in 
fact principally to imperfections in the actual conduct of the work. If, 
for example, it could be supposed that the extensive series of observa- 
tions at temperatures 23° and 25° C. had in error been completely in- 
terchanged, the evidence for non-linearity in the case of this genotype, 
though not of the others, would disappear. The fact that the test of 
significance has shown the series of facet numbers to be non-linearly 
related to the series of temperatures recorded would, if there really were 
reason to expect them to be linear on the true temperatures, have aided 
in the detection of such an error. What Dr. Berkson fails to realize is 
that the judgment, from inspection, that the line appears as straight as 
one ought to expect, would have given no aid whatever towards discov- 
ering the cause of any real anomaly, whatever the cause might be, be- 
cause that judgment in effect denies the evidence that any real anomaly 
exists. 








A. MANUEL FOX, 1889-1942 


BRAHAM MANUEL FOX, for many years a member of the American 
Statistical Association, died at Chungking, China, on June 21, 
1942, at the age of 53. At the time of his death he was engaged in his 
duties as a member of the American-Chinese Stabilization Fund Board. 
Mr. Fox was born in Philadelphia. He studied civil engineering and 
economics at Cornell University, the University of Michigan, and New 
York University. From the latter he received the degree of Master of 
Arts in 1924. Before entering government service, Mr. Fox for several 
years was engaged in expert economic research for the State of Michi- 
gan and for leading private corporations, and in teaching economic sub- 
jects in the College of the City of New York. 

For many years Mr. Fox was active in the work of the United States 
Tariff Commission. He joined the staff in 1923 and soon thereafter be- 
came chief of the Economics Division. In 1934 the position of Director 
of Research was created and he was appointed to fill it. He became a 
member of the Tariff Commission in 1937, resigning in 1941 to join the 
American-Chinese Stabilization Fund Board. 

During his many years as a member of the staff of the Tariff Commis- 
sion Mr. Fox was the recognized leader of its research activity. He was 
largely responsible for the thoroughness and soundness of the wide 
variety of investigations conducted by the Commission. One of his most 
important contributions was the planning and directing of the extensive 
report entitled Economic Analysis of Foreign Trade of the United States 
in Relation to the Tariff, prepared in response to a Senate Resolution and 
published in 1933. 

Both during his incumbency as Director of Research and during his 
membership on the Tariff Commission, Mr. Fox participated exten- 
sively in the government’s trade-agreements program. He directed 
the preparation of a large mass of information furnished to the Con- 
gress in connection with the discussion of the enactment and subse- 
quent extensions of the Trade Agreements Act. He had charge of the 
preparation, by the commodity experts and economists of the Tariff 
Commission, of digests of information concerning every commodity 
on which possible concessions by the United States were to be consid- 
ered by the interdepartmental Trade Agreements Committee. He was 
himself a member of that committee and exercised much influence in its 
decisions. He was also for some time Vice Chairman of the Committee 
on Reciprocity Information, whose function is to receive written and 
oral statements from interested parties concerning possible concessions 
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by the United States in trade agreements, as well as concessions to be 
sought from the foreign countries. For a number of months Mr. Fox 
was the head of an economic mission which visited Venezuela, at the 
request of the government of that country, and made important recom- 
mendations regarding tariff and other economic matters. 

The American-Chinese Stabilization Fund Board consisted of five 
members—an American, an Englishman, and three Chinese. Mr. Fox’s 
activities as a member were even more varied and responsible than the 
name of the board would suggest. In a broad sense he was a general eco- 
nomic adviser to the Chinese Government and a general adviser to the 
United States Government with respect to all economic relations with 
China. For several months not long before his death he was in Wash- 
ington participating in important financial negotiations between the 
two countries. During most of the time he was in China, Mr. Fox was 
either in Hong Kong, from which he and his associates had an exciting 
escape, or in Chungking. 

The following is an extract from a resolution by the United States 
Tariff Commission concerning the death of Mr. Fox: 

Throughout his nearly two decades of service with the Tariff Commission 

Mr. Fox showed unusual energy; remarkable thoroughness, keenness and 

ingenuity in the conduct of research; and sound judgment on economic prob- 

lems and on public policy. In his personal relationships he combined inspiring 
leadership with friendliness and justice. He sought always the public in- 
terest. The qualities which Mr. Fox had manifested in his service with the 

Commission were equally conspicuous in his service in China, which, more- 

over, involved often great personal hardship and danger. By his sudden 

death in the full vigor of his powers, the United States, the Republic of 

China, and the other United Nations have lost a highly efficient and deeply 

patriotic public servant. 

E. Dana DuRAND 

United States Tariff Commission 








DAVIS RICH DEWEY, 1858-1942 


AVIS RICH DEWEY who died in Cambridge on December 13, 1942, 
D was secretary of the American Statistical Association for twenty 
years (1886-1906). He was also throughout that period editor of its 
Publications, though during the latter portion of this period he was 
assisted by a committee. Because these Publications developed later 
into the Journal of the American Statistical Association, he might 
almost be considered the first editor of this JourNau. He was certainly 
the Statistical Association’s first editor. In addition to being present at 
the founding of the American Economic Association in 1885, he became 
its vice-president in 1908 and its president in 1909, he was also an early 
member of its publications committee and became finally the first edi- 
tor of the American Economic Review. He served as managing editor of 
that magazine for thirty years (1911-1940). As editor, as statistician 
and as economist, Professor Dewey was interested always in the accu- 
rate reporting of fact. 

He was born at Burlington, Vermont, in 1858, and graduated from 
the University of Vermont in 1879, taking Phi Beta Kappa honors. He 
taught for one year in a district school at North Underhill, Vermont, 
and in the following year was principal of a country academy at Under- 
hill. In 1881 he was appointed principal and teacher of Latin and Greek 
at the Hyde Park High School, near Chicago. In 1883 he entered the 
graduate department of Johns Hokpins University, secured a fellow- 
ship there, and in 1886 received his doctor’s degree in economics. While 
at Johns Hopkins he connected himself with Bradstreet’s and was a 
regular correspondent of the publication; he spent the summer of 1885 
in editorial work in the offices of Bradstreet’s. 

Coming directly from Johns Hopkins in 1886 to an instructorship in 
history and economics at the Massachusetts Institute of Technology, 
he was made professor of economics and statistics in 1893. He was head 
of the department of economics and statistics until his retirement, and 
was in charge of the course in engineering administration from its be- 
ginning in 1913 until 1931, when a separate department of engineering 
administration was established. 

Always interested in questions of public welfare, Professor Dewey 
served on many state and national commissions. In 1894 he was chair- 
man of the Special Commission in Massachusetts to Investigate the 
Subject of the Unemployed; in 1896 he was a member of the Board to 
Investigate Charitable and Reformatory Institutions of Massachu- 
setts; in 1903 he was a member of the Commission on Relations between 
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Employer and Employee. As a result of these investigations he shared 
in the publication of several reports relating to the respective subjects. 
From 1897 to 1905 he was a member of the Municipal Department of 
Statistics of the City of Boston. In 1901 he was appointed special expert 
agent on wages of the Twelfth United States Census; in this latter ca- 
pacity he conducted the research necessary for the publication in 1903 
of one of the most significant contributions of the Census Bureau to 
wage statistics, a volume entitled Employees and Wages, a collection of 
wage rates covering thirty-four industries. Between 1928 and 1932 he 
served on three emergency boards and on one board of arbitration ap- 
pointed by the President of the United States to investigate railroad 
disputes. These cases involved several railroads of the Middle West and 
some nation-wide labor organizations: the Kansas City, Mexico and 
Orient system; the Western Railways versus the Order of Railway Con- 
ductors and Brotherhood of Railway Trainmen; the Illinois Central 
Railroad, the Yazoo and Misssissippi Valley Railroad and the Gulf and 
Ship Island Railroad; the Louisiana, Arkansas and Texas Railway 
Companies. 

He was connected with educational institutions in various capacities. 
He received an honorary degree of LL.D. from the University of 
Vermont in 1910. He was a trustee of Massachusetts State College 
from 1909 to 1939. At various times he gave courses in economics at 
Johns Hopkins University, Wellesley College, Harvard University, 
Columbia University and the University of California. His interest ex- 
tended also to secondary education; and he became recently a trustee 
of the Browne and Nichols School for Boys in Cambridge. 

Among Professor Dewey’s more important publications are Financial 
History of the United States, in recognition of which he was awarded in 
1904 the John Marshall Prize by Johns Hopkins University and which 
went into its thirteenth edition in 1941; he was editor of the two-volume 
work, Discussions in Economics and Statistics by Francis A. Walker 
(1899); articles in Literature of American History edited by Larned 
(1902); National Problems, 1885-1897 (1907); State Banking before the 
Civil War (1910); joint author with Mr. M. J. Shugrue of Banking and 
Credit (1922); articles for Palgrave’s Dictionary of Political Economy; 
the New International Encyclopaedia Americana; Encyclopaedia Bri- 
tannica; American Year Book, and the Commonwealth History of 
Massachusetts. 

His friends loved to think of him surrounded by his family in hi 
summer home at New London, New Hampshire. The flower gardens, 
which he tended himself, were notable and the view from his farmhouse 
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was picturesque. Occasionally Professor and Mrs. Dewey left this de- 
lightful location for a summer of travel—in 1923 to the Pacific Coast 
and in 1929 to Europe when he attended the annual meeting at Warsaw 
of the Institut International de Statistique; in 1936 to Mexico for an- 
other meeting of the I. I. deS. 

In 1886 Dr. Dewey was married to Mary C. Hopkins of Madison, 
Wisconsin, who survives him. Their children are Mrs. A. Barr Com- 
stock, now living in Washington, and Colonel Bradley Dewey, presi- 
dent of the Dewey and Almy Chemical Company and now deputy rub- 
ber administrator. He is also survived by his brother, Dr. John Dewey, 
by six grandchildren, Dr. Bradley Dewey, Jr., Dr. Davis R. Dewey, II, 
Marguerite Mellen Dewey, Ann Dewey, Ensign A. Barr Comstock, Jr., 
Dorothy Comstock and by a great grandchild, Margot Dewey. 

No man ever remained long in contact with him without being helped 
by his kindness and inspired by the nobility of his character. 

Beatrice A. Rocers and 
Dona.p S. TUCKER 








CHARLES P. NEILL, 1865-1942 


HE PASSING of Dr. Charles P. Neill on October 3, 1942, at his home 
Tis Washington, D. C., after a protracted illness, closed a long and 
eminent career. Among the positions he occupied in a life crowded with 
notable accomplishment were: Professor of political economy at the 
Catholic University of America; Assistant Recorder of the Anthracite 
Coal Strike Commission; United States Commissioner of Labor; Mem- 
ber of the United States Immigration Commission (1907-10); Manager 
of the Bureau of Information of the Southeastern Railways (1915-39); 
Member of the United States Coal Commission (1922-23) ; and Umpire 
for the Anthracite Conciliation Board for over twenty years. 

Dr. Neill, a son of Julia and James Neill, was born in Rock Island, 
Illinois, in 1865, and his early youth was spent in Austin, Texas. He 
attended the University of Texas, received his Bachelor’s degree from 
Georgetown University (1891), his Master’s degree from the Univer- 
sity of Notre Dame, and his Doctor’s degree from Johns Hopkins Uni- 
versity (1897). While still pursuing his studies at the last mentioned 
institution he became instructor at the Catholic University and was its 
professor of political economy when he resigned in 1905 to become 
United States Commissioner of Labor. 

Notre Dame University conferred upon him the Honorary Degree of 
Doctor of Laws in 1908, and in 1922 awarded him the Laetare Medal 
for his outstanding services in controlling child labor in the South and 
for the mediation of labor disputes. 

As Commissioner of Labor under Presidents Theodore Roosevelt and 
Taft, Dr. Neill’s services were regarded as so valuable that he was re- 
appointed at the beginning of President Wilson’s administration. 

During the great part of Dr. Neill’s incumbency, 1905-1913, the Bu- 
reau of Labor was in the Department of Commerce and Labor and 
many important functions now performed by various other Federal 
agencies devolved upon that Bureau which was a pioneer in the promo- 
tion of labor legislation. It may be recalled that workmen’s compensa- 
tion legislation, which is now taken for granted, was then considered as 
a remarkable step forward in social progress in the United States. 

The Bureau was also charged with the investigation of the conditions 
of woman and child labor, and under the direction of Dr. Neill made a 
most comprehensive study of this subject, the results of which were 
published in 19 volumes. 

Under the Erdman Act, which constituted the embryo of later elabo- 
rate railroad legislation for the preservation of industrial peace, the 
Chairman of the Interstate Commerce Commission and the United 
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States Commissioner of Labor were constantly called upon in person 
to use their good offices in railway labor controversies. Judge Martin 
A. Knapp, then chairman of the Interstate Commerce Commission, 
and Dr. Neill were instrumental in averting numerous important 
strikes, some of which would have been colossal in their scope. The pre- 
vention of a threatened national tie-up of telegraph lines was among 
the triumphs for the Bureau of Labor in the field of industrial relations. 

In 1906 President Roosevelt appointed Dr. Neill and James B. Rey- 
nolds to investigate the meat-packing industry in Chicago. The findings 
were embodied in a courageous report which aroused nation-wide inter- 
est. 

While he was Commissioner of Labor, Dr. Neill was also a member of 
the United States Immigration Commission (1907-10), which made a 
study of the immigration problem from innumerable angles and issued 
its findings in 41 volumes. 

Dr. Neill served as Umpire for the Anthracite Conciliation Board 
from 1906 to his resignation in 1929, combining this arduous office with 
various other important activities. The following excerpts from an edi- 
torial in The Scranton Times, of October 5, 1942, headed ““The Anthra- 
cite Law Giver,’’ summarize his work in this field. 

As long as anthracite coal is mined the generations of mine workers .. . 
will be indebted to Dr. Neill. 

The interpretations or decisions made by Dr. Neill still represent the 
body of the law of the anthracite. Except for modifications due to the wider 
introduction of machinery and the development of mine engineering and 
new work, the law of the anthracite industry established by Dr. Neill is the 
rule today as it affecis wages and working conditions directly and human 
relations potently in a great and unique industry. 

Dr. Neill was long a Fellow of the American Statistical Association 
and in 1916 he served as our President. 

For many years Dr. Neill was prominent in the national Capital’s 
civic and welfare organizations, served on the Community Chest, and 
was for eight years Vice President of the District of Columbia Board of 
Charities. 

Dr. Neill was a staunch Catholic and a dominant influence in Na- 
tional Catholic movements in the field of social action. He gave gener- 
ously of his time on national committees. He was deeply interested in 
the promotion of the National Catholic Welfare Conference, especially 
in its formative years, and was the first director of the National Cath- 
olic School of Social Service. After his resignation from that office he 
continued to lecture on economics at that institution. 

A. F. HInrIcHs 


Acting Commissioner of Labor Statistics 








NORMAN JOHN SILBERLING, 1892-1942 


ORMAN J. SILBERLING died on October 18, 1942, at his residence 
N in Palo Alto, California. At the time of his death he was Vice- 
President of the American Statistical Association, covering the field 
of facts and methods bearing upon economics and economic theory. 
The Association has lost a valuable member. 

Dr. Silberling was born in Cleveland, Ohio, in 1892. In his under- 
graduate days at Harvard he developed a keen interest in economics. 
After receiving the Bachelor’s degree, magna cum laude, in 1914, he 
continued his economic studies as a graduate student, receiving the 
Master of Arts degree in 1915 and Doctor of Philosophy in 1919. His 
doctoral dissertation was on the History of Theories of Money and 
Credit from Adam Smith to John Stuart Mill. It pointed the way toward 
his life work. The following year, Dr. Silberling was appointed instruc- 
tor of the Department of Economics at Harvard, and during 1920-1921 
held an assistant professorship at Dartmouth College. 

In the fall of 1921 he went to the University of California as associate 
professor of Economics, receiving his full professorship in 1929. In 1930, 
his status was changed to that of part-time lecturer in order that he 
might give more of his time to the research organization which he had 
established. In 1935 he was appointed Professor of Business Research 
in the Graduate School of Business. He continued in this capacity until 
his death. 

Dr. Silberling was an able and inspiring teacher but his major inter- 
est lay in financial research. In The Silberling Reports, later Graphic 
Survey of Industry and Finance, and in confidential weekly letters, cir- 
culated among his business and investment clients, he made interesting 
statistical analyses of the economic development of the western region 
which should be of real value to economic historians of the interwar 
period. His greatest satisfaction came from his labors in what he called 
the sphere of business dynamics. In this field his book, which fortu- 
nately he had completed with the exception of minor revisions in the 
later chapters, is a major contribution. 
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REPORTS OF THE ASSOCIATION 
Report of the Board of Directors 


As nineteen hundred and forty-two, one of the most momentous years in 
the history of the nation and the world, draws to a close, we again undertake 
to review the activities and accomplishments of our Association and to ap- 
praise the outlook for the immediate future. As in the life of the nation the 
year was one of rapid change and repeated readjustment to new conditions, 
so in the history of the Association it will be recorded as one in which prece- 
dents were superseded and traditions broken. For both patriotic and practi- 
cal reasons, the Annual Meeting which, more than any other single activity, 
symbolizes the Association’s life and progress was cancelled. To the Vice- 
Presidents and other officers and members who participated in the planning 
of the Cleveland meeting, we wish to express hearty appreciation of the 
efforts expended and sincere regret that the sessions as planned could not 
materialize. We hope that a considerable number of the papers intended 
for the Annual Meeting will become available to the members in the columns 
of the JouRNAL. 

The contribution of the Association to the prosecution of the war may be 
measured in terms of the active participation of its officers and members in 
the vital work of the government war agencies, institutions of learning, 
and industrial concerns. Many of our members occupy key posts in the statis- 
tical branches of the government service. Others are serving as consultants 
on a part-time basis. College and university members are carrying heavy 
teaching loads and, in addition, are collaborating on important war research 
projects. Other members are contributing to the progress of the war through 
the development of new techniques, and the application of statistical con- 
trols, in industry. Your officers and Directors have devoted considerable 
attention to the study of ways in which the Association might contribute 
directly to the furtherance of the war program. Under instructions from the 
Board of Directors, the Secretary early in the year interviewed government 
statisticians, business officials and members of university and foundation 
staffs to obtain suggestions as to possible contributions the Association might 
make to the war effort. One of the Directors, serving as a committee of one 
appointed by the Board, has devoted much time to exploration of means by 
which the Association might help to increase the extent and improve the 
quality of applications of statistics to war problems. 

Chapters: The Chapters have experienced the impact of the war in varying 
degree. All have been affected to some extent by the loss of active members 
to the armed forces, by the emigration of others from their home communi- 
ties to accept positions elsewhere and by the preoccupation of the remaining 
members with war work. It is encouraging to note, however, that, despite 
these handicaps, unusually fine programs have been developed by some of 
the Chapters. The Washington Statistical Society and the New York 
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District Chapter have made tentative plans for the presentation of a selec- 
tion of the Annual Meeting papers at Chapter meetings. New Chapters 
established during the year were the Asociacion Cubana de Estadistica 
Chapter in Havana, Cuba, and the Central New Jersey Chapter. The Charter 
of the Lehigh Valley Statistical Society, which had been inactive for several 
years, was revoked. The Detroit District Chapter, also inactive in recent 
years, has resumed regular meetings and it is anticipated that a reorganiza- 
tion of the Madison, Wisconsin, Chapter will be effected early in 1943. With 
the cancellation of the Annual Meeting in 1942 and the prospect that the 
usual type of general meeting will have to be discontinued for the duration 
of the war, the contribution of the Association to its members becomes in- 
creasingly dependent upon the vitality of the Chapters. 

The Biometrics Section: During the year the Biometrics Section held joint 
meetings with the American Society of Biological Chemists, the American 
Association of Cereal Chemists, the Soil Science Society of America and the 
American Society of Agronomy. Additional joint sessions with five other 
biological societies and with the Institute of Mathematical Statistics were 
planned for the Annual Meeting of our Association in Cleveland and the 
meetings of the American Association for the Advancement of Science in 
New York but had to be cancelled. While the organization of additional 
sections in other fields and the better coordination of sections with the parent 
Association are included in the long range aims of the Association, the 
Directors feel that efforts in these directions should await a more propitious 
time than the present. 

Committees: In spite of the handicaps imposed by the national emergency, 
most of the Association’s committees continued to render effective service 
in their respective fields. The Census Advisory Committee held three two- 
day meetings. Consideration was given to the need for expanding the staff 
of the Census Bureau and retaining a nucleus of well-trained professional 
persons for carrying on current work, conducting intercensal projects and 
planning for the decennial censuses. There was also extended discussion of 
questions relating to the establishment of a national population registration 
system and the role of the Bureau in the prosecution of the war. The Com- 
mittee on Census Enumeration Areas was effective in promoting the distri- 
bution of the 1940 census tract publications of the Bureau of the Census 
through extensive correspondence with representatives of the various census 
tract cities. The Joint Committee on Occupational Classification served as 
a clearing house for occupational classification questions and advised various 
agencies undertaking occupational coding. A subcommittee was actively 
engaged throughout the year in checking the convertibility of the titles in 
the Supplement to the United States Employment Service Occupational 
Dictionary to the Convertibility List and in reviewing the allocation of 
Army occupations in the Occupational Dictionary. The Joint Committee on 
Relief Statistics continued to function as an advisory committee in con- 
nection with various aspects of the work of the Social Security Board and 
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individual members of the committee were called upon for advice at various 
times. 

During the year an advisory committee to the Federal Security Ad- 
ministrator in the field of morbidity statistics was appointed to serve under 
the chairmanship of Dr. Lowell J. Reed. The Committee on Hospital Statis- 
tics was discontinued. To meet the needs formerly served by the latter Com- 
mittee, Dr. Halbert L. Dunn and Dr. Edwin L. Crosby were appointed 
consultants from the American Statistical Association to the Committee on 
Accounting and Statistics of the American Hospital Association. 

Members of the Joint Committee for the Development of Statistical Ap- 
plications in Engineering and Manufacturing advised and consulted with 
government and industrial groups on the application of statistical control 
techniques to problems arising in connection with the war. They also co- 
operated in the arrangement of scientific meetings and in the organization 
of statistical effort in engineering and scientific fields. A subcommittee on 
Engineering and Scientific Graphs of the Sectional Committee on Standards 
for Graphic Presentation submitted to the American Standards Association 
its proposed standard “‘Engineering and Scientific Graphs for Publication.”’ 
Through its representatives, the Association also participated in the work 
of the Social Science Research Council, the National Bureau of Economic 
Research, the American Association for the Advancement of Science, the 
Advisory Board of the American Year Book, the Advisory Committee to the 
Census Library Project and the American Documentation Institute. 

Publications: Volume 37 of the Journat of the American Statistical As- 
sociation, published in 1942, was similar in size and content to other recent 
volumes. As in other years, a considerable proportion of the first two num- 
bers in the volume was devoted to Annual Meeting papers. Volume 2 of the 
American Statistical Association Bulletin was completed. For reasons of 
economy, the six issues published in 1942 were much smaller than those of 
the preceding year. 

Membership and Finances: The membership records for 1942 are in striking 
contrast with those for the three preceding years and reflect in the main the 
adverse conditions of the present war period. The number of names removed 
from the rolls by reason of resignation, death, or termination for failure to 
pay dues, totaling 400, was more than one and one-half times as large as the 
total number of new members and reinstatements. This resulted in a net 
decrease of about 200 in the membership of the Association.' Of those who re- 
signed or were delinquent in payment of dues, slightly less than two-thirds 
joined the Association during the last three years. The number of new mem- 
bers enrolled during the year (232) was the same as in the year preceding 
that of the Centenary Celebration. 

Income from subscriptions, as well as from dues, showed a substantial 
decrease in 1942. Nevertheless, as the result of increased income from other 


1 These estimates were superseded by figures compiled at the end of the year. See the Report of the 
Secretary, immediately following. 
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sources, the institution of economies in office and other expense, and the con- 
tinued generosity of the American University in permitting us to retain our 
present quarters, the Association was able to close the year with a slight 
excess of receipts over expenditures. As shown in the financial statement, 
the Association’s income for the year was $19,633.08 while the total expenses 
were $19,271.60. No funds were withdrawn from the Centenary Sustaining 
Fund during the year. The balance in this Fund at the close of 1942 was 
$4,802.97. 

Outlook for the Future: As suggested earlier in this report, your Directors 
believe that the present times are unfavorable to the maintenance of a pro- 
gram of continued expansion such as was contemplated at the time the 
Centenary Sustaining Fund was established. It seems to them the part of 
wisdom, therefore, to adopt a program of retrenchment for the duration of 
the war, curtailing expenses to keep, if possible, within the income available 
from regular sources alone, and husbanding the resources of the Centenary 
Sustaining Fund until such time as conditions offer greater promise of the 
success of undertakings which would necessitate drawing upon that Fund. 
When opportunities for useful service become apparent and the prospects 
for the successful realization of our objectives become brighter than at pres- 
ent, we shall feel justified in again drawing upon the Centenary Sustaining 
Fund to defray the expenses of an enlarged program of activity. 


W. Leonarp Crum LowELL J. REED 

R. L. FUNKHOUSER WINFIELD W. RIEFLER 
F, Lesytre Hayrorp WaLrTrer A. SHEWHART 
ALFRED J. LoTKA FREDERICK F. STEPHAN 


Report of the Secretary 


It is customary for the Secretary to present certain statistics about the 
membership of the Association at the close of each calendar year. In 1942 
the total membership decreased from 3,181 to 3,037, a net decline of 144 
members. During the year, 232 new members were elected and 24 former 
members were reinstated. Two hundred and ten members were removed from 
the active rolls for failure to pay their dues. One hundred and seventy-five 
members resigned and 15 were lost by death, including three Fellows. Of 
the latter, Carroll W. Doten and Charles P. Neill were Past Presidents of the 
Association, and Davis R. Dewey was Secretary of the Association for many 
years and the first Editor of the Publications of the American Statistical Asso- 


ciation. 
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Membership statement, December 81, 1942 





EE eee A Oe I RE ee ae 14 
Corporate members........... 4 
lial acd hiw 4 a4 ie is eae ew 109 
I .. wemee wees eaes inh crasitgaardd 2,910 

pT ee Me Saree 3,037 


One Fellow was a Contributing Member during 1942. One new Life Member 
was elected during the year, bringing the total number of Life Members to 37. 

The death of the following members was recorded during the year: Davis 
R. Dewey, Carroll W. Doten, and Charles P. Neill, Fellows; Horace R. 
Coddington, John R. Dower, Elmer R. Enlow, Emily I. Farnum, A. Manuel 
Fox, William F. French, Constance M. Gay, James L. Gore, Clyde M. 
Leffler, Norman J. Silberling, Walter E. Trent, and Frank W. Wheeler, 


Regular Members. 
R. L. Funkgovuser, Secretary 


Report of the Committee on Fellows 


During 1942 the Committee on Fellows elected ten members of the Asso- 
ciation to Fellowship. This was in accord with earlier decision by the Asso- 
ciation that the number of Fellows should be slowly increased toward a 
maximum of one hundred and fifty. Those elected this year were: 


C. I. Bliss Jerzy Neyman 

W. Edwards Deming Frank W. Notestein 
Edward L. Dodd John W. Scoville 
Edward V. Huntington Dorothy Swaine Thomas 
Robert R. Nathan Rufus W. Tucker 


At the beginning of 1942 there were one hundred and two living Fellows. 


During the year three died. 
Frank A. Ross, Chairman 
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Report of the Treasurer 


In 1942 the income of the Association from regular sources totaled 
$19,633, a decline of nearly $600 from the 1941 figure. Most of the reduction 
was in the sale of the Index to the Journal, which was published early in 1941, 
Among the other sources of revenue, declines in income from membership 
dues and from subscriptions were offset substantially by increases in advertis- 
ing revenue, the sale of reprints, and in JouRNAL sales. Owing to the War, 
a number of foreign libraries and other institutions have had to cancel their 
subscriptions to the JouRNAL, accounting for most of the decline in income 
from this source. 

As noted in the Report of the Board of Directors, there was no transfer in 
1942 from the Centenary Sustaining Fund to the general account of the 
Association, whereas the year before the sum of $2,500 was used to supple- 
ment the regular income. The Centenary Sustaining Fund is being preserved 
pending the return of an appropriate opportunity to resume an expansion in 
the scale of the Association’s activities. 

A rise in paper prices and labor costs was chiefly responsible for the in- 
crease in the cost of the Journau. With this exception, practically all items 
of expense showed reductions from the corresponding 1941 figures. Expense 
for the Bulletin was cut approximately in half by curtailing the size of the 
issues. The total of salaries and wages was reduced more than $500. Through 
the generosity of the American University in making office space available 
to the Association, nearly $1,000 was saved on rent. Economies were effected 
in the various forms of office expense, 

The increase in the cost of back numbers of the JourNnat sold during the 
year was the result of the enlarged volume of such sales. The expenditure of 
about $300, charged to the “Survey of defense statistics” was largely for 
additional personnel assistance in the office in order to free the Secretary to 
study possible steps that the Association might take to contribute more effec- 
tively to the war effort. 

Aside from the non-recurring expenses in 1941 in connection with the 
Index to the Journal, the total expenses of the Association were reduced 
about $2,900 in 1942. As a result of this retrenchment, a balance of nearly 
$400 was carried to surplus notwithstanding the drop in income. 

The balance sheet and other detailed financial statements, with compara- 
tive figures for 1941, are included in the Report of the Auditors. 

R. L. Funxnovuser, Treasurer 
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Report of the Auditors 


To the Board of Directors of 
American Statistical Association 


We have examined the balance sheet of the American Statistical Associa- 
tion as of December 31, 1942 and the statements of income and surplus for 
the year then ended. Our examination was made in accordance with generally 
accepted auditing standards applicable in the circumstances, and included 
such tests of the accounting records and other supporting evidence and such 
other procedures as we considered necessary. 

The recorded cash receipts for the year were traced to the deposits shown 
on the bank statements and the amounts for dues and subscriptions were 
tested with the membership and subscription records. The paid checks and 
relative vouchers were inspected in support of the cash disbursements for the 
year. The cash balances and the securities owned at December 31, 1942 
were confirmed by inspection or by certificates obtained direct from the de- 
positaries. We did not eheck the membership and subscription records in 
detail or make any independent verification of the inventory of old journals, 
the office records of which are based in part on data assembled in prior years, 
no recent physical inventory having been taken. 

The life membership reserve is computed on the basis of the combined an- 
nuity table of mortality with assumed interest at 4% per annum and an 
assumed annuity of $5.00 per life member, in accordance with a resolution of 
the Board of Directors on March 31, 1936. The amount treated as income in 
each year represents the excess of the reserve at the beginning of the year 
plus interest for the year and new life membership receipts over the required 
reserve at the end of the year. 

In our opinion, the accompanying balance sheet and related statements of 
income and surplus present fairly the position of the American Statistical 
Association at December 31, 1942 and the results of its operations for the 
year, in conformity with generally accepted accounting principles applied on 
a basis consistent with that of the preceding year. 

Price, WATERHOUSE & Co. 

March 13, 1943 
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AMERIOAN STATISTICAL ASSOCIATION 
BALANCE SHEETS 
December December 











Assets 31, 1942 31, 1941 
Cash in bank and on hand...................... $ 6,547.86 $ 1,753.69 
Ns oc ow cack wn ween ane een une « 262 .84 325.61 
Investments: 
United States Savings Bonds, Series D, at redemp- 
EE ee ee eae 5,320.00 5,186.00 
Stocks, at cost (at market quotations $4,125 and 
I ios da sd wares <siee a wawa oie 5,793.50 5,793.50 
Inventory of old Journals, at approximate cost..... 1,655.58 1,779.91 
Furniture and equipment, at cost less depreciation. . 479.49 541.81 
Deferred charges (expenses applicable to subsequent 
Duce cds Rabe ee ea eke anes ea aaa et 255 .57 150.31 
$20,314.84 $15,530.83 
Liabilities 
Ps cect cubene bees eeeeeeiwenains $ 1,491.30 $ 1,183.18 
Special account—Allied Social Science Associations. 481.81 


3,618.45 


Centenary Sustaining Fund, per statement........ § 
.53 2,315.79 


Life membership reserve. ..............-.-ese08: 
Deferred credits (collections applicable to subsequent 


wn > 
wo ow 
~ © 
— bd 
co 
~l 


year): 
I ear sera alata hang Bee whale ane ae aad 4,753 .67 1,267.12 
NS tot ahaa inna igh Wha a Gna wee ae EN 1,572.67 1,690.76 
eas ews wana and aaa e Oak 41.00 
ee ee er 5,311.70 4,973.72 





$20,314.84 $15,530.83 





0 
1 
1 
1 


3 


i 


ool CO 


oa nw 


io | & 





-REPORTS OF THE ASSOCIATION 


AMERICAN STATISTICAL ASSOCIATION 


INcoME STATEMENTS 
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Year ending December 31, 

















1942 1941 
INCOME: 
en eee ee $13,765.50 $14,002.00 
na staid aware ark RS aE 70.00 31.25 
oad a pea wget wicca 166.89 192.60 
Cee eee eee ee 3,277.25 3,723.25 
a oe 950.19 832.62 
PE Ati aca ve cwetesesssceseersveesecens 404 .50 174.40 
I cecilia soksds ainsa os vero atta eo aac 635.75 293 .07 
Soden to the Journal, GRIOB. . wc... cc cccccccccss 23.50 566 .00 
Rs kv dcvasenenekeuewscunens 63.76 71.07 
bein uk ai oa awe dea en aa 110.95 128.25 
Dividends and interest (after deducting $96.48 in 
1941 and $92.63 in 1942 apportioned to life 
SUUIIUIIIND DUDE OED onc icc cccsccnsccccccsee 164.79 208 .34 
$19,633.08 $20,222.85 
Appropriation from Centenary Sustaining Fund. . 2,500.00 
The Rockefeller Foundation grant for Index to the 
EE GE GINS Kix h oackeeasbevancnon'ss 912.74 
$19,633.08 $23,635.59 
EXPENSES: 
Journal—printing, mailing and reprints......... $ 4,852.81 $ 4,722.89 
deeb sassenes ive baebeiededeeNnee<oens 726 .95 1,441.75 
Es 6 oe cekantecuwwieb bieandars 11,111.07 11,682.29 
Unemployment compensation tax.............. 233 .40 238 . 96 
SESS ee yee bee eer eran rer neee 955.00 
Office supplies, printing and mimeographing..... 470.11 932.76 
General postage and carriage.................- 454.40 710.04 
Telephone and telegraph...................-.- 141.43 155.62 
Travel expense—officers..............0.+00000: 246.49 387 .20 
Mimeographing—committees.................. 41.09 59.46 
eS Ge OE I. ce ioe ccsecnsenevecess 72.00 72.00 
Cost of old Journals, sold..............0.0.0.. 204.33 50.64 
Miscellaneous expense. ............-.eeeeeeee: 307 .58 475.35 
Depreciation of furniture and equipment........ 84.62 95.61 
Membership campaign...................++++: 170.80 
Survey of defense statistics.................... 325.32 
Expense—Index to the Journal: 
Expended from Rockefeller Foundation grant, 
IR eae aan kee alee cate aia ee ed 12.74 
Expended from Association funds............ 1.35 
$19,271.60 $23,274.46 
Excess of income over expenses, carried to sur- 
Cis atau cas Wee dad kawk aka aniee ae $ 361.48 $ 361.13 
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AMERICAN STATISTICAL ASSOCIATION 
SuRPLUs STATEMENT 


AMERICAN STATISTICAL ASSOCIATION: 


Year ending December 31, 


Balance at beginning of year. ...............c0000. 
Add—Excess of income over expenses for the year, per 
iE . cick cceeeveste dens vevuweeenes 


Deduct—Appropriation to Centenary Sustaining Fund 
of amount equal to net proceeds from sales of Index 
eee Pee oe eR ci edad 6 


RN GONE OE DUID gov i iikcns canecvuseeesaewes 





1942 
$4,973.72 


361.48 


1941 
$4, 967 .24 


361.13 





$5 , 335.20 


$5 , 328.37 


354.65 





STATEMENT OF CENTENARY SUSTAINING FunpD 


Contributions and pledges (relating to 
the support of the activities of the As- 
sociation for a period of five years from 
January 1, 1940 to December 31, 1944) 

Less—Pledges not collected at December 
31, 1942 including $1,244.90 not due at 
PE Ghani caee wekkeo sash owe es 


Amounts received from contributors... . 
Interest received on bank savings account 
and increase in redemption value of se- 
Dit secdetccesaesoakeceweinns 
Appropriation from surplus of amount 
equal to net proceeds from sales of In- 
| re 


IN. 2 Ns bane awd wenows 
Less—Expenses of campaign (printing, 
postage and temporary assistance)... 


I ats err Sig 


Appropriations by the Board of Directors 
to the general account of the Associa- 
tion in accordance with the budget: 

re $1,500.00 
ak kk aa wae 2,500.00 


Year ending 


$4,973.72 




















ars 
Total December 31, oe 
1942 - 

$10, 686.02 
1,716.25 

$ 8,969.77 $1,065.00 $7,904.77 

231.58 96 .02 135.56 

378.15 23.50 354.65 

$ 9,579.50 $1,184.52 $8,394.98 

776.53 776.53 

$ 8,802.97 $1,184.52 $7,618.45 
4,000.00 





Balance December 31, 1942, per balance 
MN aglaw ates Wace oe ac orae on eer 


$ 4,802. 
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List of Committees and Representatives for 1942 


Committee on Fellows 


Frank A. Ross 
W. Leonard Crum 
John Rice Miner 


Committee on Nominations 


Frederick F. Stephan, Chairman 


Joseph 8. Davis 
George W. Snedecor 


Committee on Investments? 


F. Leslie Hayford, Chairman 
George O. May 
Walter W. Stewart 


Biometrics Section Committee 


C. I. Bliss, Chairman 
Joseph Berkson, Secretary 
A. E. Brandt 

Alfred J. Lotka 

J. Neyman 


Census Advisory Committee 


Committee on Census Enumeration Areas 


William F. Ogburn, Chairman 


Murray R. Benedict 
Paul T. Cherington 
J. Frederic Dewhurst 


Howard W. Green, Chairman 
Clarence E. Batschelet 

W. Thurber Fales 

Ernest M. Fisher 

Shirley K. Hart 


Theodore H. Brown 
Lowell J. Reed 


W. J. Youden 

Churchill Eisenhart 

George W. Snedecor 

Frank W. Notestein, ex-officio 


Luther Gulick 
Frederick F. Stephan 
Willard L. Thorp 


Charles 8. Newcomb 
Vergil D. Reed 

Leon E. Truesdell 
Lent D. Upson 


Joint Committee on Occupational Classification (With the Division of Statisti- 


cal Standards, Bureau of the Budget) 


Howard B. Myers, Chairman 
Gladys L. Palmer, Secretary 
E. Dana Durand 

Meredith B. Givens 


Edward D. Hollander 
Merrill G. Murray 
Leon E. Truesdell 
Emmett H. Welch 





Joint Committee on Relief Statistics (With the American Public Welfare 
Association) 
Ralph G. Hurlin, Chairman 
Paul Webbink, Secretary 
Neva R. Deardorff 
Emil J. Frankel 
Anne E. Geddes 

1 With terms expiring at the end of 1942, 1943, 1944, 1945, and 1946, respectively. 


? Appointed in accordance with By-Law 5 to supervise the investment of the Association's surplus 
funds, 


Howard B. Myers 
Melvin W. Sneed 

Gaylord A. Walker 
Emmett H. Welch 
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Committee on Chapters 


Robert R. Nathan, Chairman William A. Sturm 
Henry B. Arthur Charles W. Vickery, Jr. 
Edwin B. George Robert R. Williams 


Lester 8. Kellogg 
Committee on Sampling 


Frederick F. Stephan, Chairman Alfred N. Watson 
W. Edwards Deming Samuel S. Wilks 
George W. Snedecor Theodore O. Yntema 
Advisory Committee to the National Roster of Scientific and Specialized 
Personnel 
Halbert L. Dunn Ralph W. Tyler 
Walter Mitchell, Jr. Samuel 8. Wilks 


Frederick F. Stephan 
Advisory Committee on Morbidity Statistics 
Lowell J. Reed, Chairman H. 8. 
E. L. Bishop E. S. 
Hugo Muench 
Representative on the Board of Directors of the National Bureau of Economic 
Research 
Winfield W. Riefler 
Members of the Social Science Research Council® 
Stuart A. Rice 
Edwin B. Wilson 
O. C. Stine 
Representative on the Council of the American Association for the Advancement 
of Science 
Walter A. Shewhart 
Representatives on the Joint Committee for the Development ef Statistical Ap- 
plications in Engineering and Manufacturing 
Walter A. Shewhart Churchill Eisenhart 
Representative on the Sectional Committee on Standards for Graphic Presenta- 


Mustard 
Rogers 


tion 
A. H. Richardson 

Member of the American Documentation Institute 
Richard L. Funkhouser 

Representative on the Advisory Board of the American Year Book 
Frank W. Notestein 

Representative on the Advisory Committee to the Census Library Project 
Richard O. Lang 


3 With terms expiring at the end of 1942, 1943, and 1944, respectively. 








BOOK REVIEWS 


GLENN E. McLavuGHuIn 
Review Editor 


Neue Schweizerische Sterbetafeln. Bern: Beitrige zur Schweizerischen Statis- 
tik. Heft 10. 1942. 56 pp. 

Some of the principal results of this valuable study of the Central Statis- 
tical Bureau of Switzerland (Eidenéssisches Statistisches Amt) may be re- 
produced in brief. The first chapter deals with the recent Swiss life table, 
1933/37, in comparison with the former eight covering the period from 1876 
to 1932. Accordingly, the life expectancy at birth was (in years): 


Life table Males Females 
1876/80 40 .64 43 .24 
1901/10 49 .25 52.15 
1933/37 60.68 64.65 


The last values are almost exactly the same as obtained in U.S.A. 1935, for 
the white population (60.7 and 64.7) and in England and Wales, 1937 (60.2 
and 64.4), as quoted in the international comparison of the publication. Yet 
they are still somewhat below the respective values in Holland, Denmark 
(only for males), Australia, and New Zealand for the last life table per‘ods 
(around 1935). The prolongation of life in the Swiss population during the 
sixty years covered amounts to exactly 20 years for a new born boy and 21.4 
years for a new born girl. By far the largest part of this great average gain 
was due to the decrease of infant and early childhood mortality. The abso- 
lute contributions in years for the different age groups are given in a separate 
tabulation. This contribution was for the first year of life 8.6, for age 1-4 
still 3.2 years, or together more than half of the total gain in the male sex; 
the corresponding figures in the female sex are 7.8 and 3.6 years. The next 
highest gains occur in the age group 30-39 with 1.5 and 1.7 years, respectively 
for the sexes. In contrast to this, the gain in further life expectancy for the 
age groups, 70 years and above, amounts only to 0.4 for the male and 0.8 
years for the female sex. This brings out rather definitely, also in the Swiss 
life tables, the fact that the individual life span has not very much changed 
after the hazards of childhood and maturity have passed. Individual life, 
then, still lasts ‘‘three score and ten.” 

The probabilities of dying per year are given later in extenso and on this 
basis tables of survivors comparing the dying out of a generation at different 
periods of the Swiss population. The “probable life duration” as characteriz- 
ing the age in which one-half of the starting generation has died out, has in- 
creased from 47.3 (life table 1876/80) to 67.5 years (life table 1933/37) in 
the male sex; from 51.7 to 71.5 years, respectively, in the female sex. It is 
throughout all periods larger than the mean duration of life at birth. The 
development of the most important causes of death is afterwards dealt with 
in summarizing tabulations. Of special interest is the construction of a life 
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table for lung tuberculosis and for cancer. Excluding all deaths from either 
of these causes in the life table 1929/32, life expectancy results in the follow- 
ing values (years): 


























Males Females 
Age ae 
(years) All causes Without lung Without All causes Without lung Without 
of death tuberculosis cancer of death tuberculosis cancer 
| 

0 59.17 60.71 60.66 63.05 64.85 64.60 

40 28.58 29.34 30.32 31.42 31.96 33.13 
60 13.91 14.15 15.26 15.45 15.66 16.57 














The figures demonstrate that tuberculosis, although having decreased 
enormously during the last decades, still in the life table of 1929/32 causes a 
greater loss of life for the total Swiss population than cancer.! However, in 
the middle and higher ages the further expectation of life when excluding all 
cancer deaths is distinctly greater than when excluding deaths from lung 
tuberculosis. Thus the eradication of cancer would significantly increase only 
further life expectancy at these ages. 

Further life tables are constructed for the urban and rural population. 
They show that at present city dwellers, especially women, have a somewhat 
better life expectancy than country dwellers. The last chapter deals shortly 
with mortality by marital status (single, married, widowed, divorced) and 
confirms the fact well-known from other experience that also in the Swiss 
life table married people have the most favorable, the divorced ones the least 
favorable rates of mortality. 

GrorG WoLFF 

National Institute of Health 


The Theory of Prices, by Arthur W. Marget. New York: Prentice-Hall, Inc. 
Vol. II. 1942. xxv, 802 pp. $6.00. 


To the reviewer’s knowledge, this is the biggest and most learned single 
volume of its kind, surpassing the author’s own first volume. Up to 450,000 
words are crowded into it, largely in footnotes. Every sentence is strictly 
“scientific,” often of breathtaking length and of painfully precise formulation, 
with algebraic symbols interspersed, loaded with complex terminologies. 
Virtually every statement is perfectly logical in itself, and in its very explicit 
interrelations with others. The whole is supported, in a fashion reminiscent 
of the Medieval Schoolmen, by about 5,000 imposingly exact and exhaus- 
tively analyzed references taken from at least 600 books and articles, in five 
or six languages, of some 330 authors. Of course, a brief review cannot do 
justice to such a vast enterprise. Much worth-while detail as well as some 
controversial judgments must be left without due evaluation. 

1 This would come out still more distinctly if deaths from other forms of tuberculosis were also 
considered; they amount, according to the preceding tabulations of causes of death, to almost one-third 


(average number in 1929/31: 1,242) of the deaths from pulmonary tuberculosis (average number in 
1929/31: 3,879) in Switzerland. 
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The main outcome of this extraordinary effort is two-fold. First, negative: 
to prove (by a vast quantity of quotations) the incorrectness of Mr. Keynes’ 
reproach that when economists pass from the general Theory of Value to the 
Theory of Money and Prices, “we hear no more of these homely and intel- 
ligible concepts,” namely, of demand and supply; in other words, that Mr. 
Keynes’ claim to be the first one to analyze “output as a whole” in relation 
to prices is entirely unfounded. Secondly, the positive purpose is to demon- 
strate (largely by “propositions ...of an extremely elementary nature”) 
that the Marshallian partial equilibrium analysis is just as useful as it is 
generally assumed to be. It may be used for monetary theory together with 
what Marget calls “Stream Equations,” i.e., the MV = PT type of quantity- 
theory formulas, which are called a “translation” of the “Walrasian Process” 
(p. 384). Briefly, we might go on, undisturbed by Mr. Keynes’ claims on 
originality, operate with individual demand and supply schedules as we did 
before, and satisfy our sense of formal beauty by knowing that “discrete 
situations are part and parcel of the process of receiving and expending 
money in time” (p. 349). 

Parturiunt montes: nascitur ridiculus mus. Small as the “mouse” is, it 
offers endless possibilities for neat formulations and logical exercises. Legions 
of words are spent to establish the obvious; e.g., that “prices” should mean 
paid prices, and not imagined ones; that demand curves may be understood 
“ex-ante,” etc. More or less self-evident commonplaces are elaborated by 
“corollaries” and literary polemics, and substantiated by a plethora of 
quotations. What might have been fully satisfactory as a short essay—with- 
out adding much that is new to our knowledge—is inflated to a large volume, 
one that does not claim to have any relevance to matters of policy, or to deal 
with factual analysis. It evades even methodological issues. The “value” of 
equilibrium analysis, total and partial, is discussed at length but in a dog- 
matic fashion that misses the essential point: the proper distinction between 
logical consistency on the one hand, and “realism” on the other. 

Mr. Marget’s objective is purely “formal.” His professed aim is to elimi- 
nate the “internecine” struggle between the Professors of Equilibrium Theory 
and Mr. Keynes by refuting the latter’s “unprecedented” attacks against 
the former.' Notice, however, that (a) even though successful, this refutation 
has no bearing upon any of Keynes’ theories of the actual forces underlying 
monetary phenomena, or upon the policies he pursues, since it deals with the 
formal categories or “framework” of analysis only, not with analysis itself; 
(b) Marget’s claim that a demand-and-supply approach underlies almost 
every theory of the value of money since Aristotle, implies a misunderstand- 
ing of the problems which Keynes has in mind, and which were so foreign to 
Aristotle and other “economists” of a pre-capitalistic world as the problems 
of modern thermodynamics were to pre-19th century physics. 


1 Mr. Marget’s assumption that no such “internecine” quarrel has ever taken place between (aca- 
demic) theorists, is altogether erroneous. He apparently overlooks the violent attacks of the Austrian 
school on the Classics, or of the “Mathematicians” on the “Literary Economists,” to say nothing of 
the fights between Marxist and anti-Marxist theorists. 
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The reviewer, who labored conscientiously through this heavy volume, 
cannot refrain from expressing his regrets about the truly uneconomic wast- 
ing of Professor Marget’s analytical talent and colossal erudition on com- 
bating the entirely irrelevant attempts of Mr. Keynes at originality. 

Chicago MELCHIOR Paty1 


The Theory of Competitive Price, by George J. Stigler. New York: The Mac- 
millan Company. 1942. vii, 197 pp. $3.00. 


“This is a preliminary edition of a textbook in advanced economy theory.” 
The scope of the present edition is narrowly circumscribed. Part I—Intro- 
duction—includes a discussion of the scope and method of economics, an 
outline of the functions of the economic system, a discussion of certain basic 
concepts (competition, statics and dynamics, and equilibrium), and an 
exposition of the relations between the various quantities used in marginal 
analysis. Part II—Theory of Competition—covers the theory of demand 
developed in terms of indifference curves, the nature of costs and the produc- 
tion function, and the pricing of products and productive services under 
conditions of perfect competition. The author expresses a hope “In the not- 
too-distant future ... to add sections on imperfect competition, multiple 
products, capital theory, and certain other topics which usually receive 
attention in advanced theory courses.” 

The author shows a keen sense of balance in the choice between alternative 
doctrines and a judicious sense of pedagogy in the choice between alternative 
methods of exposition. The book is simply and concisely written. Mathe- 
matical concepts are illustrated by both numerical and geometrical examples. 
In addition there is considerable symbolic analysis which is conveniently 
separated from the body of the text. The principal limitation of the book, in 
addition to the narrow scope of the present edition, is that shared by most 
texts on modern economic theory, namely that it leaves the impression that 
there are no open issues. Nevertheless, the reviewer looks forward eagerly 
to subsequent enlarged editions. 

Chapters 7-10, devoted to the nature of costs and the production function 
and to pricing under competitive conditions, are the best chapters on the 
subjects available for students. Chapter 9 on pricing under competition relies 
heavily upon the Marshallian theory of time analysis modified, as has been 
urged so persuasively by Professor F. H. Knight, by consideration of specu- 
lative price. It is unfortunate, however, that greater attention is not given to 
distinguishing fixed and variable costs. There are many problems attached to 
this distinction which would be brought to the fore by an operational discus- 
sion of costs as seen by the individual firm. A fruitful exposition of short-run 
normal prices should consider the path by which the individual firm expands 
and contracts its plant and output. Such a discussion would raise the ques- 
tion of the short-run reversibility of the marginal and average curves and 
would show the limited usefulness of the concept of the long-run average cost 
or envelop curve. 
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Some will question the desirability of approaching the pricing problem via 
the theory of pure competition, although the author presents a persuasive 
defense of his choice. Others, including the reviewer, will regret that as now 
conceived the later editions will evidently ignore many problems of modern 
theory, including the theory of general equilibrium, the theory of income 
flow, problems of economic dynamics, and the theory of economic welfare. 
The theory of demand is an arid subject except as related to the theory of 
economic welfare or the theory of general equilibrium. The theory of pricing 
begins to take on significance to most students only when related to the con- 
cept of national income or to some so-called “applied” problems. The sig- 
nificance and limitations of partial equilibrium theory are not fully under- 
stood until related to the theory of general equilibrium and the theory of 
income flows under conditions of disequilibrium. Surely such subjects are 
within the scope of a course in advanced theory. It is hoped that the present 
conception of the scope of subsequent editions will be extended beyond that 


now indicated. J. P. MrLuer 


Yale University 


War Economics, edited by Emanuel Stein and Jules Backman. New York: 
Farrar & Rinehart, Inc. 1942. x, 501 pp. $3.00. 


This is a volume which is concerned with the operation of the economic 
organization in war time and the effects of war upon our economy rather 
than with the economic causes of war. After noting the stages by which the 
peace economy is converted into a war economy, the authors proceed to dis- 
cuss the organization of industrial production, agriculture, transportation, 
electric power, and foreign trade for the conduct of the war. Other chapters 
deal with the problems of war financing, price control, housing, business 
finance, and consumption. The problems of labor also receive attention. The 
closing chapter discusses the problems of industrial demobilization after the 
war. 

The volume provides a clear and convenient outline of the economic prob- 
lems attending total war and studies the various alternative approaches to 
these problems. The necessity of planning for total war is emphasized, but 
the problems which will follow in the wake of the global conflict are not neg- 
lected. Much useful information has been assembled by the authors which 
is descriptive of the steps taken in this and other countries in the effort to 
prosecute the war to a successful conclusion. Fortunately for us, many les- 
sons may be learned from the experience of the other belligerents. 

As one reads the volume he gains an appreciation of the immensity and 
complexity of the battles of production, transportation, financing, and 
marketing. It should prove helpful and interesting to students of economics 
and should provide a ready text and reference for those who wish to inform 
themselves of measures taken in organizing our economy for the war and the 
problems which face us. A wide range of topics is covered in a book which is 
not large but which contains a great deal of information. 
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On particular points the reader may wish to disagree or he may feel that 
some favorite theory has been neglected. Not every topic of importance in the 
economics of war could be treated, but the authors have succeeded in pre- 
senting a surprisingly comprehensive description and discussion of our 
economy in war time. ALrrep G. BUEHLER 

University of Pennsylvania 


The Women Graduates of a Collegiate School of Business, by Ann Brewington 
and Evelyn Van Emden Berg. Chicago: The University of Chicago Press. 
Studies in Business Administration, The School of Business. 1942. ix, 99 
pp. $1.00. 

“The purpose of this study is to record some of the occupational and 
personal experiences of the women graduates of a collegiate school of busi- 
ness.” A twelve page questionnaire was mailed to the women graduates of 
the University of Chicago’s School of Business. Herein are recorded the re- 
plies of 245 of the 462 women who were graduated between January 1913 
and January 1941. The recording takes some sixty pages of text, thirty-five 
pages of appendix, and patience. 

The reporting group seems to be representative of the known characteris- 
tics of the original 462 graduates at the time of graduation, no claim being 
offered that the group is representative of anything more than these gradu- 
ates. The result is a listing of miscellaneous information, of interest primarily 
to those persons who are curious about the women graduates of the Uni- 
versity of Chicago’s School of Business. Even here, however, generalization 
is difficult, for the findings are often uncontrolled for such primary refine- 
ments as age at graduation, and date of graduation. 

Five chapters are presented under the rubrics “Educational Preparation 
of the Women Studied,” “Careers,” “Finances,” “The Personal Equation,” 
and “A Long Look Ahead.” Each chapter intones a procession of absolute 
and relative figures calculated to impress the reader with the unexceptionable 
character of these girl graduates, their varied careers, sundry financial condi- 
tions, variegated personalities, and the fact that they face the future with 
mixed feelings of hope and fear. 

The reader may judge from these glyptic globules just how well the obvious 
has been elaborated: 

“The 53 homemakers have been married an average of 7.9 years, and their 
average age is 34 years. .. . The value of their School of Business training 
is evidenced by the fact that some occasionally assist their husbands. . . 
24 are always consulted by their husband about business problems, and 18 
are sometimes consulted... .” 

“College education in business widens employment opportunities.” 

“Application of professional knowledge and skills to insignificant and 
boring routines in business, as well as in housekeeping, results in happiness.” 

“Salaries increase with experience.” 

“Women should help one another by maintaining a cooperative attitude 
toward one another.” 
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It would be generous to close with this last bit of wisdom: “This is not a 
man-made world but a mismade world, and the fault is not entirely that of 


man.” 
Davip YENTIS 


Federal Public Housing Authority 


How Collective Bargaining Works. New York: The Twentieth Century Fund. 

1942. xxviii, 986 pp. $4.00. 

According to the foreword, “This volume contains full-length sketches of 
the actual workings of collective bargaining in sixteen United States trades 
and industries and thumbnail summaries covering thirteen other fields.” 
Sixteen authors have cooperated, under the direction of Dr. H. A. Millis, 
to provide fourteen industry and trade essays. Philip Taft has, in addition, 
summarized the development of collective bargaining in this country and 
outlined its present status in thirteen additional industries. A final section of 
the appendix presents available data as to membership of A. F. of L., C.I.0., 
and unaffiliated national and international unions. 

A fairly consistent pattern is discernible in the treatment accorded each 
of the industries. Authors first present a more or less comprehensive descrip- 
tion of the economic characteristics of each industry, in terms of volume of 
output and employment, location, competitive features, special technological 
conditions, major labor problems, and any distinctive economic aspects or 
notable recent trends. There follows a description of the development of 
collective bargaining in the industry, with special attention to historic strug- 
gles, the structure, functions, and present strength of unions involved, and 
to general and distinctive features of trade agreements developed or in force 
in the industry. Employers’ associations, if any, receive attention at this 
point. The next section considers the results of collective bargaining, with 
special reference to changes in working hours, rates of pay, the settlement 
of disputes, special work rules, effects of collective bargaining on unit costs, 
and union cooperation in meeting competition. A final division is devoted to 
appraisal, by the author or authors, of the over-all effects of collective bar- 
gaining in the industry, including a description of unsolved problems. 

The studies thus presented will provide valuable background and perspec- 
tive for all students of modern industrial relations. They are not uniformly 
well done, and no single criticism applies equally to all of them. Several of 
the industry studies tend, in the opinion of this reviewer, to forget that 
unionists remain a distinct minority in the nation’s labor force, and that 
collusion with employers may advance union members’ interests at the direct 
expense of the public. One lesson appears clearly; that is the utter necessity 
for intelligent, broadminded, powerful union leadership if collective bargain- 
ing is to become anything more than the socially dangerous tool or device 
of a selfish interest group. 

Date YODER 


University of Minnesota 
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Determining Work Loads for Professional Staff in a Public Welfare Agency, 
by Herbert A. Simon et al. Berkeley: University of California, Bureau of 
Public Administration. 1941, xii, 94 pp. $1.00. 


This monograph describes a study to determine the size of workload in a 
public assistance agency which will result in the highest proportion of eligible 
cases, together with satisfactory service to recipients, at the lowest total 
cost. The study was conducted in two districts of the Los Angeles County 
Emergency Relief Administration by the Bureau of Public Administration 
of the University of California. 

The experiment studied three steps in the performance of the agency 
function: 

1. The intake or “qualifier” interview. 

2. Home and collateral visits by the field intake worker. 

3. The reinvestigation of active cases or “carrier” load. 

Careful controls were maintained to establish comparable conditions in 
the workloads which varied in size from the current average in the agency to 
points substantially above and below it. Of special interest was the adjust- 
ment period of a month ailowed the investigators with carrier loads so that 
no backlog of work would distort the findings. The maintenance of the carrier 
loads at a fixed number offered a stability in workload which is seldom dupli- 
cated in actual public welfare practice and may therefore be questioned. 

The most reliable test of eligibility was considered to be in the number of 
cases rejected as ineligible after acceptance. Results were measured by the 
analysis of special statistical forms kept on each case, by a social audit on a 
sample basis which included analysis of case records and visits to recipients 
and collaterals and by a time study. To this reviewer, the visits to clients 
and collaterals by the social audit group as a check on the experiment seem 
both costly and socially undesirable. 

The findings were more satisfactory in measuring the determination of 
eligibility than the adequacy of service. The latter was assessed largely in 
terms of referrals to other agencies. Whether lower workloads resulted in 
better worker-client relationship, and therefore, a more constructive experi- 
ence for the client, is difficult to ascertain and this was recognized by the 
study group. 

An interesting aspect of the findings is that the intake or qualifying inter- 
view showed the greatest gain with a lower workload, the active, or carrier 
load, was next, and the intake visiting in the field showed the least gain with 
lower loads. This again emphasizes the value of careful intake interviews and 
the average of 30 to 40 minutes allowed with the low workload is not im- 
practical. 

The findings of this study agree with those of similar studies in other parts 
of the country, namely, that lower workloads than usually exist in public 
welfare agencies would cost less and offer better service to agency recipients. 
Most public welfare administrators know this from practical experience 
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without the benefit of a scientific study but factors beyond their control often 
prevent their acting on this knowledge. The size of workload for maximum 
efficiency in most public assistance agencies can perhaps be determined as 
satisfactorily on the basis of experience, observation and analysis of local 
factors affecting size of workload as by the more elaborate and costly study 
method. 

The Los Angeles study has a particular contribution in this field because 
it so carefully details the methods used and the various elements which must 
be considered in reaching the findings. This emphasis on methodology should 
render it particularly valuable to the public welfare administrator who finds 
it necessary to conduct a similar study. 

CoNnsTANCE HasTINGs 

Social Security Board 
Random Sampling Distributions, by Alan E. Treloar. Minneapolis: Burgess 

Publishing Company. 1942. 94 pp. $2.25. 

This book is a planographed set of notes rather than a formal textbook. 
It is intended as an aid to the serious student of science who, requiring 
significance test techniques, is neither acquainted with their derivation nor 
satisfied to use formulas blindly. With this reader in mind, the author relies 
upon simple algebraic formulations of the argument, largely in terms of the 
moments of the distributions, and a liberal appeal to the geometry of the 
problems, presented in 24 drawings. Since one or both of these approaches 
is within the grasp of most science students, the book should prove to be a 
real contribution to them. 

The distributions dealt with are those concerned with significance tests 
on means, standard deviations, and correlation coefficients. In addition to 
the text discussion around the development of the relevant distributions, the 
four following appendix tables are given: a table for estimating o from s on 
the basis of assumptions concerning the position of s in its random sampling 
distribution; 5 per cent and 1 per cent points for the distribution of F (re- 
produced from Snedecor’s text); table of the probability that t, based on n 
degrees of freedom, will be exceeded; tables of z, as a function of r (repro- 
duced from the author’s text). 

Of particular value to the applied student approaching these significance 
tests is the emphasis given to the assumptions from which the algebra flows. 
These are not only stated explicitly but the limitations which they place 
upon the interpretation are stressed. It is pointed out that most of our exact 
sampling distributions rest on an assumption that the universe sampled is 
normal (an assumption that cannot be verified in most actual problems) and 
therefore the sampling distributions may kt: only approximate for the prob- 
lem in which they are used. Thus the author questions “the value of parading 
that exactness feature of the test when it rests on an assumption that is an 
approximation. ” 
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Various controversial issues with regard to the importance of the signifi- 
cance test in statistical reasoning, and the interpretation of the probability 
attached to a given test are discussed. With regard to the latter, Dr. Treloar 
does not allow a fractional probability to be attached to the statement that 
two parameters differ, or that a given parameter lies within a specified range. 
This probability he allows to be either 0 or 1. Rather, he uses the probability 
in a subjective sense, to measure the degree of confidence we have in the 
truth of the statement. However, in the discussion of the null hypothesis, 
this interpretation does not seem to be strictly maintained, for we have the 
statement, “If that probability is less than 50 per cent then the odds may be 
said to favor the difference not having arisen from random sampling, and 
one is therefore disposed to cast doubt on the accuracy of the null hypoth- 
esis.” The first part of this statement is an inverse probability statement, 
yet without any consideration of existence probabilities. It seems to be out 
of line with the rest of the discussion of interpretation of significance tests. 

The student will find this a book which not only lays a substantial ground- 
work for the tests he has frequent occasion to use, but also one in which real 
consideration rather than a pat answer is given to the meaning of the tests. 

MARGARET MERRELL 

School of Hygiene and Public Health 

The Johns Hopkins University 


Poisson’s Exponential Binomial Limit, by E. C. Molina. New York: D. Van 

Nostrand Company, Inc. 1942. 47 pp. $2.75. 

The book contains three pages of introduction, one of which, explaining 
some inspection problems, is signed by H. F. Dodge; and 47 pages of tables. 
These are divided into two parts. Table I gives values of the individual terms 
of the Poisson Law, ' 

p(x| a) =e~*a?/z! 
and Table II the values of the probabilities of a Poisson variable being at 


least equal to c; thus - 
P(c, a) =>_p(z} a). 


Beginning with a =.001 and up to a=.010 the tables proceed by increments 
of a equal to .001. Further on and up to a=.30 the increment is .01. Still 
further and up to a=15.0 the increment is .1 and beyond a =15.0 it is one 
unit. The last columns of both tables correspond to a =100. The number of 
decimals given is also varying. Up to a =.40 the figures given are with seven 
decimals, beyond that limit with six. 

As far as the reviewer is aware the present table is more extensive than 
any other table of the Poisson Law in existence. In spite of the wider and 
wider applications of the Poisson Law, its available tables were limited to 
those given in K. Pearson’s Tables for Statisticians and Biometricians, Part I. 
Of the tables given there, Table LI gives the values of individual terms of the 
Poisson Law for a<15, proceeding by increments of .1 and Table LII the 
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sums of such terms, for a $30, proceeding by increments of 1. However, the 
arrangement of Table LII, meant for one specific purpose, is somewhat 
awkward to use in other cases. 

Of course the numerical values of the Poisson Law for higher values of a 
can also be obtained from the extensive ‘Tables of the Incomplete T-Function 
edited by Karl Pearson (Biometrika Office, London, 1934). However, that 
table was again mainly meant to serve purposes other than applications of 
the Poisson Law, namely those encountered in work with the Pearson’s 
Type III frequency curve. These applications and also difficulties in setting 
up the table have forced Pearson to use a system of arguments which con- 
tributes somewhat to the inconvenience in using the Tables of the Incomplete 
I'-Function for purposes of computing the values of the Poisson Law. 

As a result of all these circumstances, the volume under review is a most 
welcome addition to the growing arsenal of statistical tools. There is no 
doubt whatsoever that in a relatively short time it will find its place on the 
shelves and desks of all statistical workers, exhausting the first edition. We 
may hope that then the Van Nostrand Company will become convinced of 
the importance of this particular publication and will re-issue it in a proper 
printed form. The present edition is lithoprinted and, in places, regretfully, 


none too well. 
J. NEYMAN 


University of California 


Encyclopaedia Britannica World Atlas. Chicago: Encyclopaedia Britannica. 
1942. Vol. XI. 223 map plates, 272 pages of text and tables, 128 pages of 
map index. 

Although the publication of this atlas by the Encyclopaedia Britannica 
marks an event of interest in the field of cartography the volume is primarily 
noteworthy for the scope and quality of its statistical contribution, a contri- 
bution representing a new departure for this estimable research establish- 
ment. For this innovation credit is due G. Donald Hudson of Northwestern 
University. This timely document merits careful study by those who may 
exert, or hope to exert, influence in rehabilitating the war-torn world. Only 
by giving conscientious consideration to such carefully compiled world-wide 
factors can a lasting peace be built. 

This atlas provides a useful series of world, hemisphere, regional, national, 
and, for the United States, state-county political maps showing place names, 
boundaries, lakes, rivers, canals, and railroads. In addition there are eight 
eye-pleasing physical maps showing elevations by tint bands. These political 
and physical maps, comprising over 200 plates with a number of large scale 
inserts, were prepared for Britannica by C. S. Hammond & Company to 
show the status of the world on September 1, 1939. 

“Geographical summaries” gives the new statistical setting. The year 1938 
was selected for the base, although statistics are provided when possible for 
subsequent years. This tabular presentation, covering 237 atlas sized pages, 
divides the world into seven continental regions, namely, North, Middle and 
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South America, Europe, Asia, Africa and Australasia and the Pacific Islands, 
For each region a short page or two of descriptive orientation introduces the 
statistics which follow, statistics carefully selected through an exhaustive 
examination of governmental and institutional sources. Over 500 of these 
sources are detailed in valuable regional bibliographies. The first series of 
nine tables group all countries of a region together. Only the very smallest 
countries are omitted. These tables provide ready comparisons between 
countries. The following information is given by countries grouped for each 
region: area, population and type of government; leading agricultural, min- 
eral and manufactured products; dollar value, major commodity and major 
source of imports and exports; communications facilities; transportation 
facilities; value of currency in dollars; government expenditure for military 
purposes; military service and personnel; and military equipment and 
organization. The regional tables are followed by fifteen more for each of 
165 countries, or approximately 2,400 individual tabulations. 

The atlas is concluded with a short section of 22 pages containing maps and 
compilations of data for “world spheres of influence,” or what might have 
been more appropriately called “areas of control.” Spheres of influence are 
shown individually as they existed for the United States, Great Britain, 
France, Germany, Spain, Japan, Italy and Portugal on August 1, 1942. The 
usefulness of this series of maps and accompanying data might have been 
greatly enhanced if they had conformed to a more significant date. Though 
it might be desirable to make all sections of an atlas as current as possible, 
still it seems unlikely that August 1, 1942, the last possible date prior to 
publication, will have any long run significance. Compilations whether carto- 
graphic, statistical or explanatory when keyed to a date falling into the un- 
certain turbulence of the rapidly changing events of a world war, partake a 
momentary character akin to a newspaper statement. This unfortunately 
is true no matter how painstakingly accurate the treatment of the material. 
It would seem, therefore, that the choice of a date just prior to the partition- 
ing of Czechoslovakia or the invasion of Poland by Germany might have 
resulted in a series of maps and compilations destined to serve more sig- 
nificantly as a reference base in planning and intelligently appraising the 
peace settlement at the close of this war. M. J. Prouproor 


War Production Board 


Distribution of Ownership in the 200 Largest Nonfinancial Corporations, by 
Raymond W. Goldsmith, Rexford C. Parmelee, Irwin Friend, James 
Gorham and Helene Granby. Temporary National Economic Committee. 
Monograph No. 29. Washington. xviii, 1,557 pp. 

Survey of Shareholdings in 1,710 Corporations with Securities Listed on a 
National Securities Exchange, by Helene Granby. Temporary National 
Economic Committee. Monograph No. 30. Washington. xviii, 258 pp. 


Monograph 29 aims to show the degree of concentration of stock owner- 
ship in the largest American corporations. Only 144 of the 1,557 pages are 
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devoted to a discussion of the material, though some additional comments 
appear in Appendix I and II. The rest of the book consists of tables showing 
the size of stockholdings in the 200 largest corporations. In all, 408 security 
issues are analyzed. Much of this information is in the form of frequency 
distributions, showing the number of shares held in each of many different 
classes representing the size of the stockholding. These distributions are 
prepared both with classes representing the numbers of shares per share- 
holding and with classes representing their value. Succeeding tables analyze 
these data from many points of view. 

There are also tables showing the number of shareholders that are indi- 
viduals, corporations, trusts, etc., while another section shows the details of 
the shareholdings of officers and directors of each of the 200 corporations. 
An interesting set of tables shows the names of the twenty largest share- 
holders of each of these corporations; this is shown both for the twenty 
largest shareholders of record and for the twenty largest legal and beneficial 
shareholders. Data are presented on shareholdings of the Mellon, Rocke- 
feller, Du Pont, and other families. 

The text is devoted to discussion of the sources of data, methods of an- 
alysis, and conclusions and “implications.” One of the most interesting parts 
of the text is the comparison of the degree of concentration of ownership of 
different companies in the same industrial division. For each of many 
branches of industry, graphs are drawn presenting Lorenz curves for indi- 
vidual corporations. 

The degree of concentration of ownership as shown by these curves and 
other data appearing throughout the text is indeed pronounced. It is pointed 
out that it is not the directors or management that hold this high degree 
of power, but usually the family of the founders—in its own name and in 
trusts, estates, and personal holding companies. In the case of most families, 
the fortune has been retained through reinvestment in the securities of the 
original company. Branching out into other lines of enterprise is noted only 
in the case of the Mellon family. 

The monograph is well worked out as a study of the concentration of 
power. It is also valuable for students of corporation finance and invest- 
ments. There is a little/too much emphasis on statistics as such; as with 
other TNEC studies, the data themselves do not always point to as many 
horrors as we suspect the authors would have us believe. 

Monograph 30 “provides a background for, rather than a supplement to, 
the companion study...” It is an analysis of the 1,710 corporations, fi- 
nancial and nonfinancial, which have securities listed on national security 
exchanges, whereas the preceding monograph studied the 200 largest non- 
financial corporations whether or not listed. The analysis of concentration 
of shareholdings is comparable with that in Monograph 29. However, be- 
cause of the larger number of cases, it was possible in many instances further 
to subdivide the industries into which the companies fall. Also, subdivisions 
were made to disclose the distribution of shares in widely-held corporations 
as contrasted with closely-held corporations. 
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As in the previous monograph, the tabular presentation takes up by far 
the larger portion of the book. Lorenz curves corresponding to those in 
Monograph 29 are used to portray the concentration of stockholdings, 
Instead of a curve for each corporation, however, only curves representing 
groups of corporations are presented. These groupings, corresponding to 
classifications in the tabular material in the appendix, are based on the 
industry with which the corporation is identified and on its size as measured 
by amount of assets. 

Throughout this monograph the emphasis is on totals rather than on 
individual companies. Logically, Monograph 30 should have come first; as 
the introduction says, the study provides a broad basis on which analysis 
of individual companies, their directors, and their shareholders is built in 
Monograph 29. 

Raupu C. Epstein 
Joun H. Myers 
The University of Buffalo 


Merchandise Availability in Utah, by David E. Faville, Dix M. Jones and 
Richard B. Sonne. Stanford University: Stanford University Press. 1942. 
xvi, 229 pp. $3.00. 

This is a study of merchandising in rural Utah. It is the result of a survey 
conducted by the Stanford Graduate School of Business at the request of a 
“far-seeing group of Western Merchants (and with their financial backing) 
to ascertain whether similar income groups living in different localities, 
especially in small villages as contrasted with larger towns, are enabled to 
enjoy to the same degree the various comforts of life, or whether in certain 
districts higher prices and a limited range of merchandise result in various 
scales of living.” No particular area was designated as the field for study, 
but Utah was seiected primarily because of its distinctive rural villages, its 
population characteristics, and its western location. 

The report deals mainly with the availability and retail price variations of 
commonly used food, drugs, clothing, dry goods and miscellaneous items. 
It tells where and why different groups did their purchasing, whether pur- 
chases were made locally or out of town, and for what reasons; whether the 
stores were independent, corporate chains or contract chains. The authors 
tell us whether or not the stores surveyed were neat, clerks courteous, service 
prompt; whether there was a good selection of merchants available and 
whether prices for advertised goods were uniform in competitive stores in 
the same community. 

The original study was conducted by the Survey Method in which house- 
wives and store operators in selected rural communities were interviewed. 
In all, 2,156 consumer interviews were obtained over a period of four months, 
1,122 in the rural towns and 1,034 in the control cities. This represented 
approximately a 25 per cent sample of both areas. The survey also included 
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interviews with 170 store operators in the four control cities and 57 in rural 
towns plus detailed checking of shelf prices at the stores. The questionnaires 
thus obtained were then tabulated on International Business Machine cards. 
These cards were sorted according to various factors and the results analyzed 
statistically. 

The report, therefore, contains many interesting tables and charts de- 
signed to reveal the vast amount of significant findings that came out of 
the survey. 

In the price section some attempt was made to determine the reliability 
of price variations by correlation and significant differences technique. Some 
interesting relationships were obtained where sufficient data were available. 
However, in some cases apparent differences proved to be insignificant; 
hence no reliable conclusions could be drawn. 

The more important conclusions were as follows: “In so far as the ready 
availability of a wide range of drug, variety, clothing, and dry-goods articles 
bears on the enjoyment of the various comforts of life, the residents of small 
rural towns and villages interviewed in this survey were at disadvantage as 
compared with their neighbors in medium sized towns. Limited selection, 
high prices, or complete unavailability were common for these items in the 
rural districts, and these deficiencies were most keenly felt by farmer and 
laborer groups.” They also concluded that “the rural-town stores surveyed 
did not commonly carry a variety of fresh fruits, vegetables, and meats.” 

The survey appears to have been well planned and ably executed. The 
results were thoroughly analyzed and written up in an interesting style. The 
few criticisms heard by the writer from others who have read the book might 
be summed up in the slang phrase, “so what?” In other words, the job was 
well done and the story well told, but relatively little new light has been 
thrown on the screen. 

DitwortH WALKER 

University of Utah 
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